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Abstract 

The interactions of electroweak gauge bosons are severely con- 
strained by the symmetries inferred from low energy observables. The 
exploration of the dynamics of electroweak symmetry breaking re- 
quires therefore both an experimental sensitivity and a theoretical 
precision that is better than the natural scale of deviations from mini- 
mal extrapolations of the low energy effective theory. Some techniques 
for obtaining improved theoretical predictions for precision tests at a 
future e + e~ linear collider are discussed. 
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Preface 



Several excellent, up-to-date reviews of the wide range of physics that can 
be probed at a future e + e~ Linear Collider (LC) are available @, 0j- The 
present review discusses some physical and technical aspects of electroweak 
gauge bosons, that will be important at a LC. The importance of gauge 
invariance even in a conservative Effective Field Theory (EFT) approach, 
which does not pretend to be valid to arbitrarily short distances, is em- 
phasized. The implementation of gauge invariance in perturbation theory 
is discussed. Techniques for improving the convergence and speed of high 
dimensional phase space integrations are described. 
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Introduction 



1.1 The Art and Craft of 
Elementary Particle Physics 

To teach how to live without certainty, and 
yet without being paralyzed by hesitation, is 
perhaps the chief thing that philosophy, in 
our age, can still do for those who study it. 

[Bertrand Russell: A History 
of Western Philosophy] 

In a highly idealized scenario, Theoretical Elementary Particle Physics (TEPP) 
proceeds by condensing the results of experimental observations up to some 
energy scale into a model, which is designed to remain consistent at higher 
energies. In a second step, predictions for experiments at these higher en- 
ergy scales are derived. Finally, comparisons with subsequent experimental 
results are used to refine the model, if necessary, and the process starts anew. 

Therefore, we can identify two complementary tasks: model building on 
one hand and calculation of observables on the other. Below, we will focus our 
attention in the area of model building to the systematic procedure of EFT 
and its role in the modern interpretation of the Standard Model (SM) with 
simple extensions. The methods for the calculation of observables will be 
discussed in greater detail. Such calculations are naturally divided into two 
separate parts: calculation of amplitudes and differential cross sections and 
phase space integration. Some methods for both parts are described with a 
noticeable bias towards the author's research interests. 
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Figure 1.1: Physics reaches of past, present and future colliders. Hadron 
colliders can not utilize the full beam energy in partonic collisions since the 
momentum is shared by quarks and gluons. As a rule of thumb, the lower 
border of the grey rectangles ( \fs' = y/s /10) corresponds to the physics reach 
of hadron colliders. (Data taken from / |7I| , \T^j.) 



1.2 Success and Failure of the Standard Model 

Tout comprendre, 
c'est tout mepriser. 
[Friedrich Nietzsche] 

In the decade after the commissioning of LEP1, many aspects of the Stan- 
dard Model || of elementary particle physics have been tested successfully 
to a precision previously unimaginable [|7|, ||. Indeed it is not without irony 
that one of the main sources of theoretical uncertainty derives from the in- 
sufficient precision of the old data on hadron production in e + e~-collisions 
at a few GeV, despite recent theoretical progress in the interpretation of this 
data M. 



The important qualitative features of the SM were established from 
low energy experiments at a time before any of the colliders shown in fig- 
ure |1.1| were commissioned. The symmetry structure of the SM as a SU(2)^ x 
U(l)y gauge theory that is spontaneously broken to the U(l) of Quantum 
Electrodynamics (QED) has been firmly established by the stunning success 
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Figure 1.2: The famous "blue band" derived from fits of electroweak precision 
data to SM predictions jj^J, that suggests a very light SM Higgs particle, with 
a mass close to the present exclusion limit. The area excluded by direct 
searches is shaded in light grey on the left. The parabolas show the distance 
in likelihood from the best fit as a function of the Higgs mass for two different 
determinations of the running electromagnetic coupling o.qed{Mz)- 



of the SM at LEP1. Nevertheless, direct information regarding the underly- 
ing dynamics of the Electro Weak Symmetry Breaking (EWSB) has yet to 
be found. 

The existing data is compatible with the minimal SM with a light ele- 
mentary Higgs particle ||. Global fits of the SM predictions to the LEP1 
precision observables (including m top from Tevatron and My/ from LEP2) 
suggest a Higgs mass close to the present exclusion limits (see figure |1.2| ). 
However, the Higgs mass enters the precision observables only logarithmically 
and the limits are therefore weak. 

Despite the phenomenological hints in its favor, it must be mentioned 
that the minimal SM with an elementary Higgs particle is unattractive from 
a theoretical point of view. There is a quadratically divergent contribution 
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to the Higgs mass renormalization 

^m ggs = _Q_ oc ^A 2 (1.1) 
that requires an unrealistic fine-tuning of the mass to one part in 10 38 in order 



to keep the Higgs light and the EWSB scale close to 1 TeV (see section |A.2.5 
for more details). 

There are two ways around this problem and both feature fermions promi- 
nently. A supersymmetry can relate the scalar mass to a fermion mass. Since 
a fermion mass is protected from renormalization by chiral symmetry, it can 
be naturally light and the scalar mass can naturally remain on the order of 
the supersymmetry breaking scale. On the other hand, Goldstone bosons 
from the spontaneous breaking of a continuous symmetry are protected by 
Goldstone's theorem and can be naturally light on the order of the symmetry 
breaking scale as well. Here the chiral symmetry of fermions is required to 
protect the particles that take part in the interaction, which is responsible 
for the symmetry breaking, from a large mass renormalization. The latter 
models are disfavored by current precision data. 

A necessary, though not sufficient, condition for the unification of gauge 
interactions in a single simple group at a Grand Unified Theory (GUT) scale 



is that the running couplings (see sections |A.2.8| and |A.3.2j for details) have 



a common value at one scale. According to the data from the pre-LEPl 
era, the SM has this property. However, LEP1 precision data revealed that 
the coupling do not meet, unless supersymmetric partners are added to the 



particle spectrum [15 



Even if the naturalness problem is solved by the Minimal Supersymmetric 
Standard Model (MSSM) and gauge coupling unification is achieved, models 
with an elementary Higgs particle remain unsatisfactory, because they not 
only provide no insight into a theory of flavor Jl6| , they also screen any theory 



of flavor so effectively that experiments at the GUT scale would be required 
to shed any light on the open questions of the physics of flavor, in particular 
the ratios of fermion masses and the fermion mixing angles. 

A strongly interacting weak sector that generates EWSB dynamically has 
a greater potential of explaining the physics of flavor. In such a theory it is 
conceivable that everything can be calculated from (renormalized) Clebsh- 
Gordan coefficients in a tower of spontaneous symmetry breakings. Such a 
reduction from the continuous manifold of free parameters to a discrete set 
of symmetry breaking patterns allowed by group theory would increase the 
predictive power of TEPP immensely. 

Today we have no working theory of flavor that could explain the ratios 
of fermion masses and the values of the mixing angles. We do not know if 
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Figure 1.3: W + W~ -production cross section ||/ close to the threshold is in 
spectacular agreement with the SM prediction. Already at 183 GeV ; the need 
for a W + W~Z° vertex was clearly visible. 



Nature was kind enough to hide the answer at the TeV-scale. Of course, she 
is under no obligation to do so and could as well have used supersymmetry to 
put the answers out of our experimental reach somewhere between the GUT 
and Planck scales. The remarkable success of the MSSM with no intermediate 



mass scales and gauge coupling unification |L5| might indeed have been the 
first direct evidence of such a grand desert. 



1.3 The Mission of the Linear Collider 

From the previous section, we can identify the main missions for a future 
LC. Assuming that neither LEP2, TEV33, or LHC have done so, the LC 
has to try to discover the Higgs particle and supersymmetric partners of the 
particles in the SM. If a Higgs particle has already been found at the time of 
commissioning, the LC will be in a unique position to determine its nature 
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Figure 1.4: Combined LEP2 and Tevatron limits on Triple Gauge Couplings 
at the time of the ICHEP98 conference in Vancouver, B.C. (July 1998) [PJ[]. 
The SM values are favored, but the 90% CL boundaries are still 0(1). See 
figure \T7h] for a dramatic improvement (note the different scale!). 



by measuring its properties, such as branching ratios, that are difficult to 
access at a hadron collider, in detail. 

Complementary to this discovery mission, the LC has to perform precision 
measurements of the properties of the top quark and of electroweak gauge 
bosons. Whatever the mechanism of mass generation for the fermions will 
turn out to be, the top quark couples most strongly of all known fermions 
to the symmetry breaking sector. In fact, in the minimal SM, the Yukawa 
coupling of the top quark to the Higgs particle is of the order one. Thus 
the couplings of the top quark have to be measured as precisely as possible, 
because they offer the best chance of learning something about the interaction 
of the symmetry breaking sector with the fermionic sector. 

Finally, the LC is in a unique position for precision measurements of 
the properties of the electroweak gauge bosons. LEP2 has firmly estab- 
lished the trilinear W + W~Z° interaction of gauge bosons. The shape of the 
W A+ V4 /_ -production cross section at the threshold as shown in figure |1.3| is as 



convincing as direct fits of the coupling parameters in figure [L5 



Due to limited statistics and energy, LEP2 will not be able to measure 
the W + W~Z° couplings at a level where deviations from the minimal SM 
can reasonably be expected [TJ|, (see chapter |2|). In a strongly interact- 



ing model, with no observed light resonances, deviations from the couplings 
predicted by low energy symmetries are on the order of (see sections |2.7| 
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Figure 1.5: Combined LEP2 and Tevatron limits on Triple Gauge Cou- 
plings at the time of the MORIOND 1999 conference (March 1999) ffl. The 
189 GeV data gives a dramatic improvement over figure (note the differ- 
ent scale!) and vanishing TGC are now definitely excluded. 



and |A.3.5|) 

1 



16tt 2 



6.3 ■ 1(T 3 (1.2) 



and in a weakly interacting model, this factor is further reduced by cou- 
pling constants. Unless Quantum Field Theory (QFT) breaks downF] just 
above the LEP2 energy, the LEP2 experiments are not able to establish 
any deviation from the SM |p3| . The LEP2 measurements are nevertheless 



important, but we should be disappointed by the "failure" to see any de- 
viation from the SM. The improvements in the determination of the mass 
of the iy ± -bosons will be more significant, because M\y is not constrained 
by a low energy symmetry. An improved measurement probes the custodial 
symmetry, if combined with precision calculations and measurements of Mz 
and sin 2 6 W . 

The situation will change qualitatively with the commissioning of the LC. 
The LC will be able to detect deviations at the level of ( |1.2| ) and will be the 
only machine to do so JE], @, [|. Consequently, precision measurements of the 
properties of electroweak gauge bosons are the final piece of a no-lose theorem 
for a high luminosity (C > 10 34 cm _2 sec _1 ), high energy (a/s ph 1 TeV) LC: 

Either an elementary Higgs particle and possible supersymmetry 
is found at the LC as a signal of a weakly interacting symmetry 



1 Recently, the idea of solving the hierarchy problem by introducing large extra dimen- 
sions at the TeV-scale |^0| has attracted a lot of attention. Such a scenario allows many 
spectacular signatures that avoid the bounds from Renormalization Group (RG) argu- 
ments in four-dimensional QFT p2[ and will not be discussed here. 
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breaking sector or the couplings of electroweak gauge bosons will 
deviate from the minimal SM predictions at a measurable level in 
order to accommodate a strongly interacting symmetry breaking 
sector. 

In order to cover the second part of this no-lose theorem, the craft of precision 
electroweak gauge boson physics has to be mastered. 



1.4 Roadmap 

Antigonus: Thou art perfect then, 
our ship hath touch' d upon 
The deserts of Bohemia? 
Mariner: Ay, my lord, . . . 
[William Shakespeare: The Winter's Tale, III. 3} 



Chapter |2| sketches the phenomenological bottom-up approach to the field 
theoretical description of particle physics experiments and applies it to the SM 
of electroweak interactions. Most of this material can be found as part of 



systematic expositions in modern textbooks |24], , but is included any- 

way in a condensed and slightly provocative form in order to put the other 
chapters in perspective. Chapter || attempts to give a concise description 
that separates the features of the SM of particle physics that are firmly es- 
tablished by phenomenology and the general principles of QFT, on one side, 
from model dependent assumptions that have not been tested so far, on the 
other side. 

Chapter ^| in particular assumes familiarity with the Renormalization 
Group (RG) for QFT in the Wilsonian approach P7[] , roughly on the level 
of p6| . Therefore, appendix [S] reproduces a short and deliberately elemen- 
tary introduction to the Wilson RG for QFT, based on the first half of lec- 
tures presented at the German Autumn School for High Energy Physics, 
Maria Laach, September 9-19, 1997. The physical intuition provided by the 
Wilson RG is crucial for understanding why the SM is so successful at LEP1 
and LEP2 and why we can nevertheless expect a lot of exciting physics at 
a LC. 

Chapters |3| and ^ dive deeper into the more technical parts of the phe- 
nomenologists' toolbox. Chapter [3] discusses specific issues arising in the cal- 
culation of matrix elements and differential cross sections for gauge bosons 
in Perturbation Theory (PT). Chapter [| describes methods for phase space 
integration and event generation. These chapters combine published original 
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contributions with results that appear to be known only to the practitioners 
of the field as "tricks of the trade." 

Finally, chapter ^ discusses some aspects of the forthcoming physics of 
electroweak gauge bosons at a LC. 
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—2— 
The Standard Model as 
Effective Quantum Field Theory 



And as the smart ship grew 
In stature, grace, and hue, 
In shadowy silent distance grew the Iceberg too. 
[Thomas Hardy: The Convergence of the Twain] 

This chapter explores some consequences of the general principles of QFT for 
the phenomenology of elementary particles. This material is hardly original 
(see the texts [Q |29|), but is included in a condensed and intentionally 

provocative form in order to set the stage for the following chapters. The 
purpose of this exercise is to allow a separation of kinematics and symmetry 
on one side, from dynamics on the other side. The result of the application 
of this formalism to the SM is Janus-faced: many predictions are very solid, 
because they depend on kinematics and symmetry only, but, by the same 
token, the independence from dynamical details reveals how little we still 
know about the underlying dynamics of the SM. This view on the virtues 
of the SM is purposefully pessimistic. It demonstrates that, despite the 
tremendous achievements of the past, there is still a lot of exciting physics 
waiting to be discovered at the high energy frontier. 

Perturbative renormalizability has often been presented as the main guid- 
ing principle behind the construction of the SM. But when we acknowledge 
our ignorance about the "true" short distance physics that is out of the reach 
of the currently operative, as well as the currently planned, colliders, we can 
not adopt it as a guiding principle anymore. After all, QFT might just be 
cut off at some string scale. However, as stressed in section |A.2.5| , among all 
equivalent theories that describe Nature, there is always one perturbatively 
renormalizable QFT, that contains only relevant and marginal interactions 
at the highest energy scale. Since perturbatively renormalizable QFTs are 
a technically more convenient starting point, it is often more economical to 
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Figure 2.1: Scattering amplitude. 



use them as the initial condition. Nevertheless, irrelevant interactions will 
be observed at low energies and one has to explain them as the result of 



matching conditions at intermediate thresholds as described in section |A.3.3 
in order to show the consistency of the description. 

A comprehensive and systematical modern treatment of all the theoretical 
issues is provided by Weinberg's text |25 |. A more pedagogical treatment 



from a similarly modern perspective is given in p6| , while more on the phe- 
nomenology can be found in [BU| . 



2. 1 Currents 

current: in general circulation or use; . . . 
[Concise Oxford Dictionary] 

The asymptotic single particle states in a scattering experiment are character- 
ized by quantum numbers belonging to the Poincare group (energy E = P , 
momentum -Pi,2,3, and angular momentum L 3 , L 2 ) as well as by internal quan- 
tum numbers corresponding to conserved charges Q n : 

Pp, \p, l,h,<h,--- > Qn) = P» \p, I, h, qi, ■ ■ ■ , q n ) (2.1a) 

L 2 \p, /, l 3 , q 1 , . . . , q n ) = 1(1 + 1) \p, I, l 3 , q u ... , q n ) (2.1b) 

£3 \p, l,h,qi,--- , q n ) = h \p, I, h, ?i, • • ■ , q n ) (2.1c) 

Qn \p, l,h,Qx,--- , q n ) = q n \p, I, h, qi, ■ ■ ■ , q n ) ■ (2. Id) 

For this to make sense, the Q n have to commute with all generators of the 
Poincare groupQ 

[P M ,Q„]=0 (2.2a) 



*We do not include supersymmetry charges among the Q ri 
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[L^,Q n ]=0 (2.2b) 

and in particular 

[H, Q n }=0. (2.2c) 
In general, the Q n will represent a compact Lie algebra 

[Qn 1 ,g n ,]=^^ in2n3 Qn 3 (2-3) 



"3 



that factorizes into simple non-abelian and abelian subalgebras. The quan- 
tum numbers ( |2.1| ) define the asymptotic states in scattering experiments 
completely, if we employ the cluster decomposition principle for asymptotic 
multi particle states. However, the operators appearing in ( |2 . 1| ) are not suf- 
ficient for a Lorentz invariant description of the interaction of the states 
corresponding to tensor products of asymptotic single particle states. 

The creation and annihilation operators in the Fock space built from the 
states in fl2.1| ) are non-local, i. e. their Fourier transforms neither commute 
nor anti-commute at space like distances. However, the only known way (see 



24] for a discussion) of constructing a relativistic (i.e. Lorentz invariant) 
scattering matrix uses a local interaction, i. e. a Lagrangian density that 
is a Lorentz invariant polynomial of local operators that commute or anti- 
commute at space like distances 

[<f>(x), 0V)]± = for ( x - x 'f < ( 2 - 4 ) 

and their derivatives. Therefore, the particle creation and the anti-particle 
annihilation operators have to be combined into local field operators that 
satisfy ( |2.4j ). As a result, it is impossible to have interactions that conserve 
particle number and the correct description of relativistic scattering will in- 
volve an infinite number of degrees of freedom. 

The most profound physical consequence of the mathematical result that 
there are an infinite number of degrees of freedom is the phenomenon of Spon- 
taneous Symmetry Breaking (SSB). For a finite number of degrees of freedom, 
Wigner has shown long ago that all symmetries of the interaction have to 
be symmetries of the ground state and the symmetry has to be represented 
linearly. However, Wigner's theorem breaks down for an infinite number of 
degrees of freedom and symmetries can be broken spontaneously, guarantee- 
ing the existence of massless scalar particles through Goldstone's theorem. 
This allows non-linear realizations of these symmetries |HT[]. The Goldstone 
bosons are protected from obtaining large masses through radiative correc- 
tions, unlike regular scalars. On the other hand, non-linearly realized gauge 
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symmetries provide a consistent description of massive spin-1 particles that 
does not suffer from bad high-energy behavior (see section |2.6| for a detailed 
discussion). 

The charges Q n can be constructed from local quantities as the integral 
of a charge density j®(x) on a space-like hypersurface x = t 

Q n (t)= J d 3 xf n (x), (2.5) 

xo=t 

where j®(x) is the time component of a conserved current j%(x) 

d^(x) = 0. (2.6) 

If the matrix elements of the j%(x) fall off sufficiently rapidly at large dis- 
tances, the charge Q n is conserved 



dQ n 



dt (t)= J d 3 xd f n (x)= yd 3 xVJ n (x) = 0. (2.7) 

xo=t xo=t 

These conserved currents are particularly useful in the phenomenology of 
interacting field theories, because they are not renormalized. For the non 
abelian factors of the Lie algebra generated by the charges Q n , this is trivial, 
because any renormalization — ► Zj^ would induce a renormalization Q — > 
ZQ and violate the commutation relations ( |2.3| ). For the abelian factors, we 
have to make the further assumption that there are operators ip that form a 
non trivial representation 

[Q n , ip] = q n ^ip . (2.8) 

In this relation, any renormalization of the if) will cancel and we find again a 
non renormalization theorem for the current. 

Matrix elements of the conserved currents in asymptotic states are con- 
strained by symmetry, independent of dynamical assumptions. For example, 
the matrix element of a current between two scalar single particle states with 
identical mass (p 2 = p' 2 ) must have the formF] 

(P f , q' n \jnA0)\p, qn) = (pi + Pu)q n F((p> - pf)6 q > ngn , F(0) = 1 (2.9) 

from Lorentz symmetry and current conservation alone, where the normal- 
ization of the form factor at zero momentum transfer F(0) = 1 is fixed by 



2 The eigenstates are orthogonal for hermitian charges Q: (q'\Q\q) 
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g„,2po(27r) 3 5 3 (p / - p) = q n (p', q n \p, q n ) = (p', q n \Q n \p, q n ) 

= J d 3 x (p',q n \j n ,o(x)\p,q n ) = J d 3 xe i£( P- p,) (p',q n \j nfi (0)\p,q n ) 

Xo=t Xo=t 

= (27c) 3 5 3 (p'-p) {p',q n \j n , (0)\p,q n ) , 

i. e. 

q n 2po = (p,gnbn,o(o)b,g n ) . (2.10) 

Similar relations can be derived for matrix elements of particles with spin, 
but in general there is a larger number of independent form factors for which 
symmetry provides less constraints. 

2.2 Weak Interactions 

The low energy manifestations of the interactions described by the Stan- 
dard Model (SM) are the long range electromagnetic interaction and the 
short range weak interaction. In Nature, we observe weak interactions in the 
burning of the solar fuel 

Ap + 2e~ -> Tie + 2u e + 26.73 MeV - E v . (2.11) 

While the released energy is the binding energy of the Tie nucleus due to 
the strong interaction of protons and neutrons, the process ( |2.11|) would 
be impossible without the weak interaction, because the strong interaction 
conserves isospin, which forbids p+e~ — > n+u e transitions. The other natural 
manifestation of weak interactions is in the nuclear /3-decay, for example 

137 Cs -> 137 Ba + e~ + v e . (2.12) 

Decades of particle physics experiments with accelerators have found many 
more examples of weak interactions that can be observed because they violate 
the isospin and parity invariance of strong interactions (see 0] for the most 
exhaustive and up-to-date list). 

2.3 Fermi Theory 

[Pythagoras' pupils] 
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As mentioned in the previous section, there are two defining characteris- 
tics of weak interactions at low energies: parity violation and isospin non- 
conservation. The former is the result of the experimental observation that 
all electrons (and muons and taus) produced in /3-decay are left handed, while 
all positrons (and anti-muons and anti-taus) produced in /3 + -decay are right 
handedQ. Therefore, the weak interaction couples in the leptonic sector to 
(V — A) charged currents of the form 

z>7 M (l -7 5 )e + h.c. . 

Using the r = (ti ± ir 2 )/2 isospin shift operators, such charged currents 
can be combined in the i-components of a conserved isospin vector current 



{v(x) e(x)) 



1-75 



+ OvO) M^)) 



T 



7m" 



u(x) 
e(x) 

1 - 75 



v^x) 
fj,(x) 



+ (u T (x) t(x)) 



75 



u T (x) 
t(x) 



which is written more compactly as 

T 



® 7^ " 



75 



(2.13) 



(2.14) 



where the summation over all isospin doublets is implied. The neutral compo- 
nent j®(x) of the current ( p.!4[) is masked at low energies by much stronger 
electromagnetic interactions of charged leptons and strong interactions of 
hadrons and was observed only much later, after neutrino beams became 
available. 

Making the educated guess that the hadronic part is described by a similar 
isospin current, the simplest short range interaction imaginable couples these 
currents 



9 



X 



(2.15) 



with g a suitable coupling constant and A a suitable scale. According to 
section |A.2.5| , the interaction ( |2.15| ) is irrelevant and expected to be very 
weak at low energies (compared to A). 

3 This observation summarizes many complicated experiments and ingenious theoretical 
interpretations. 
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The interaction ( |2.15| ) has been tested quantitatively (including radiative 
corrections, see e. g. f30j ) in the leptonic sector, where the matrix elements of 
the products of isospin currents can be calculated without further assump- 
tions. From the comparisons with experimental results, lepton universality, 
i. e. that each doublet of leptons contributes identically to the current as 
in (|2.13|) , is firmly established. 



For the hadronic part of the current, no ab-initio calculations are possible 
yet, due to our limited understanding of the non-perturbative dynamics of 
the strong interactions. In semi- leptonic processes like B + — > D°/i + z/ /J , the 
matrix element of the interaction can be factorized in terms of a leptonic and 
a hadronic current 

( f ,+u fl D0\j+(x)r^(x)\B+) = (fi + Vlt \j+(x)\0){DO\j-^(x)\B + ) . (2.16) 

Model independent symmetry arguments for the conserved vector current and 
partially conserved axial current similar to (|2.9|) can be used to express (|2.16|) 
in terms of a single unknown form factor. For heavy hadrons, the form 
factor has a known normalization a zero momentum transfer, up to small 
corrections fffi| . This provides non-trivial cross checks for such processes and 
the interaction ( |2.15[) has passed these tests. Similar symmetry arguments 
have less predictive power for light baryons and vector mesons, because there 
are more independent form factors, but some non-trivial checks can still be 
performed. 

Compared to this, purely hadronic weak processes remain poorly un- 
derstood, because the matrix elements can not be factorized in terms of 
hadronic currents and ab-initio calculations in terms of quarks are not yet 
possible. However, this reflects more on our still poor understanding of non- 
perturbative QFT than on the accuracy of the description of weak interac- 
tions with the Fermi interaction (|2.15|) . 



2.4 Flavor Symmetries 

What are the hadronic currents that the weak interaction couples to? A 
comprehensive study of the transitions among hadrons induced by the weak 
interaction currents 

Jameson = E ^FfX^k (2.17a) 

./;;.„„,„:, >>,77,r /( vi>, (2.17b) 
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SU(6) L ® SU(6) R ® U(V 
m t ,m b ,rrt7 



SU(3) L ® SU(3) R ® U(V 




INC: y,Z 



SU(3)ul® SU(3) DL 
®SU(3) U R®SU(3) DR ®U(1 



3 



SU(2' 



DL 



SU(2) 



DR 



Ufl 




SU(3) L ® SU(3) UR ® SU(3) DR <g> U(1 



Figure 2.2: Chiral symmetry breaking patterns of hadrons by weak interac- 
tions and quark masses. On the left hand side, the symmetry is broken by 
an as yet unknown mechanism that creates heavy quark masses. In the right 
hand side, the embedding of the weak interaction gauge group in the global 
chiral symmetry groups results in quite different symmetry breaking patterns. 



together with the hadron masses M~ 



J 0, meson 



v 0,baryon 



E \l ( w f - \m 2 ^ 



(2.18a) 
(2.18b) 



reveals intriguing symmetry patterns for the meson fields and baryon 



fields tyj, which can be summarized by the succession of global 
gauged) SU(iV) flavor symmetry groups in figure 



i.e. un- 



The strong interac- 



4 Note that the SU(6)i i ii symmetry groups in figure 2^2 contain some revisionistic his- 
toriography, because no hadrons with top flavor are observed in nature. Furthermore, the 
Fermi interaction is not applicable at energies as high as the top threshold. Therefore, 
a strictly phenomenological analysis would use SU(5)l,.r instead. Nevertheless, for the 
purpose of studying the symmetry properties, it is useful to pretend temporarily that the 
top is lighter than it actually is. 
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tions appear to be completely flavor blind and hadrons can be classified in 
representations of a global SU(6)x (g> SU(6)# ® U(l) symmetry group^. 

The classification of the hadronic currents and their interactions accord- 
ing to the symmetries described in figure |2.2j is complicated by the fact 
that the observed hadrons do not live in the fundamental representation N 
of SU(iV). Instead they live in the irreducible components of product rep- 
resentations that carry no overall charge of the color SU(3), responsible for 
Quantum Chromodynamics (QCD), the strong interaction in the quark pic- 
ture. These representations of SU(iV) are N <g> N = 1 © (iV 2 — 1) for the 
mesons and N <8> N ® iV for the baryons. However, the leptons do form a 

5 The six different flavors will later correspond to quarks, coming with left handed and 
right handed chiralities. Note that the SU(6)'s must not be confused with the nonrela- 
tivistic SU(6) symmetry containing the SU(3) of the light hadrons and the SU(2) of spin 
angular momentum. 



Table 2.1: The Standard Model (SM) of elementary particle physics. We 
have color (C) singlets (1) and triplets (3), weak isospin (T) singlets and 
doublets (2 ) and various hypercharge (Y ) assignments. 
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Ve,R (?) v^r (?) 






(C,T)y 


Q = T 3 


"t,R (?) 


(M)o 







(l,l)-2 




(":;') 


(1,2)-! 


(I 









tR 


(3, 


1)4/3 


b R 


(3, 


l)-2/3 


;;) 


(3, 


2)l/3 






9 








(1, 





Y 
~2 




18 



fundamental representation of the symmetries in figure |2.2| , as shown in the 
upper left part of table Furthermore, there is independent experimen- 
tal evidence, in particular from deep-inelastic lepton hadron scattering, for 
quarks as constituents of hadrons, which live in a fundamental representation 
(see table |2.1|) . In particular, the properties of hadrons with heavy flavors 
are well described by heavy quarks surrounded by a SU(3)l ® SU(3)# flavor 
symmetric "cloud" of light quarks and gluons The product representa- 
tions for the observed hadron spectra are therefore understood as the bound 
states of quarks. 

The SU(6)l © SU(6)# symmetry is badly broken by the heavy flavor 
masses m t , m&, and m c , which leaves the SU(3)l©SU(3)r of the eightfold way 
of the three light flavors P3|| . This is already a useful symmetry that can be 
used to derive quantitative results, provided that the breaking of the flavor- 
SU(3) to the isospin-SU(2) due to the finite strange quark mass is taken into 
account properly, together with the spontaneous chiral symmetry breaking 
SU(3)l © SU(3)i? — > SU(3)l + ^ caused by QCD. The product representations 
are the familiar singlet plus octet for the light mesons 3 © 3 = 1 © 8 and the 
singlet, two octets and a decuplet for the light baryons 3©3©3 = 1©8©8©10. 

On the other hand, the SU(6)l © SU(6)# is also broken by differing 
charges for weakly interacting neutral current interactions (i. e. Z°-exchange 
and QED) for up- type quarks and down- type quarks. The remaining SU(3)[/x$ 
SU(3)f/i? corresponds to the flavors u, c, and t of the up-type quarks and 
the SU(3)dl©SU(3) dr corresponds to the flavors d, s, and b of the down-type 
quarks. This symmetry in broken further by the weakly interacting charged 
current interactions to a SU(3)l for the left handed part that interacts and 
a SU(3)(7/j©SU(3) dr for the non-interacting right handed parts. Again, these 
symmetries are badly broken by heavy flavor masses, but the SU(3) l©SU(3)# 
and the SU(3)[/l © SU(3)dl © SU(3)f7# © SU(3)d,r have a common sub- 
group SU(2) DL © SXJ(2) DR that is a good symmetry and has interesting phe- 
nomenological consequences ||3U] . 

An additional complication is provided by the fact that the SU(3)fy£ © 
SU(3)[/i?, the SU(3)dl©SU(3)dr, and the isospin SU(2) of the weak interac- 
tions twisted by the CKM mixing matrix. Without the intuition provided by 
quarks, this low energy symmetry structure of the weak interactions would 
be very hard to understand. 

The SM provides a complete phenomenological description of the cur- 
rents described in this section. However, of all the symmetry breakings in 
figure |2.2j , it only explains the two steps on the right, which are due to neu- 
tral currents and charged currents. The other symmetry breakings, as well 
as the CKM mixing have to be put in by hand, introducing most of the free 
parameters in the SM. 
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The strong interaction physics that is responsible for the relation between 
the fundamental representation and the product representations is interesting 
in itself, but not of fundamental importance for the weak interaction physics 
under consideration. The significant observation of the present section is that 
the leptonic and semi-leptonic low energy weak interactions can indeed be 
described by the Fermi interaction (|2.15 ) for observable hadronic and leptonic 
currents. The axial part of the short range neutral current weak interaction 
has also been observed more recently in low energy experiments through their 
parity violating effects. Advances in experimental atomic physics have made 
atomic parity violation a viable tool for precision physics |5T] . 

The emerging picture of the weak interactions at low energy is that of 
interacting isospin SU(2) currents, where the charged components act on the 
left handed fields and the neutral component couples differently to the two 
members of the isospin doublet particles. The symmetry is incomplete, as 
long as the charged current and neutral current pieces of the interaction ap- 
pear to be unrelated. Below in section we will see that the corresponding 



gauge bosons are indeed related by a custodial SU(2) C symmetry [43 



2. 5 Higher Energy Scattering 



Having established the Fermi interaction ( [2.15 ) for low energy weak interac 



tions, we must study if and when this description will break down at higher 
energies, either phenomenologically or from theoretical inconsistencies. From 
power counting, the interaction is irrelevant and the RG flow in the ultravio- 



let will eventually leave the perturbative domain (see section |A.2.4|) . But in- 
stead of studying radiative corrections, we can also study a tree level process 
like ev scattering. The differential cross section is found to be quadratically 
rising with energy 



On the other hand, we can project the scattering amplitude on the S'-wave 
component and verify the unitarity of the partial wave 

\Mj =0 \<1. (2.20) 

This gives an upper bound on differential cross section 

dcr 1 , 

(2 ' 21) 
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Figure 2.3: Differential cross section and S-wave tree-level unitarity bound 
for e~v e — > e~v e . 



Both the differential cross section and the unitarity bound are shown in 
figure [2.3| , where the unitarity bound is turned on smoothly, because we have 
to allow for higher order corrections close to the bound. In any case, it is 
obvious from figure |2.3| , that the Fermi interaction (|2.15|) has to be replaced 
before 1 TeV. 

The solution to this problem is provided by softening the interaction at 
high energies, e. g. 



1 

A 2 



-1 



A 2 



(2.22) 



2 Thp 

CM- 1Ile 



for some combination p 2 of kinematical invariants that scales as E t 
particular example in ( |2.22 ) is just one of many possibilities that reduce to 
the Fermi interaction at low energy. However, it corresponds to replacing the 
local interaction Q2.15 ) by the exchange of vector bosons 




32tt 2 E CM 



i a , 2M w 

1 - cos 9 + 



(2.23) 



and can therefore be described again by a local interaction. Unlike an arbi- 
trary form factor, a local interaction produces a Lorentz invariant scattering 
matrix (see section 12. II). The formal Lorentz invariance of an ad- hoc form 



factor in a non-local interaction does not suffice for Lorentz invariance of 
the scattering matrix for all matrix elements if the form factor is taken into 
account consistently EM. Space-like correlations will ruin Lorentz invariance. 
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Incidentally, the interaction that produces the vector boson exchange to 
replace the irrelevant current-current interaction, 



L{x) = Y. A »j a " 



x 



(2.24) 



is marginal (see |A.2.5| ) and the scale in the irrelevant interaction is replaced 
by the mass of the new particle. This fits of course nicely within the EFT 
paradigm, where a low energy EFT breaks down at the scale where new 
particles and interactions have to be introduced. 



2. 6 Intermediate Vector Bosons 

The introduction of intermediate vector bosons ( |2.23| ) is not without prob- 
lems, because unitarity requires that all particles that are exchanged must 
also appear in external states. Massive vector bosons have three degrees of 
freedom and massless vector bosons (photons) have only two degrees of free- 
dom. However, a naive counting of the components of a Lorentz vector finds 
four degrees of freedom. 

One approach to a consistent theory would construct the vector bosons 
as narrow spin one resonances of scalars or spin- 1/2 fermions. In such a 
theory, unitarity could be proven in a Hilbert space of asymptotic states built 
from scalars and spin- 1/2 fermions, avoiding all problems with unphysical 
degrees of freedom. However, constituents that could form resonances with 
the correct quantum numbers are not observed and Occam's razor suggests 
to attempt a description in terms of the vectorial degrees of freedom before 
inventing new constituents. 

The current operators have all the right quantum numbers, but they 
have a mass dimension of three and can therefore not be identified with 
intermediate vector bosons with a mass dimension of one. Unfortunately, 
the very fact that conserved currents are not renormalized prevents them 
from acquiring an anomalous dimension and turning into a vector field with 
canonical dimension for sufficiently large coupling. 

For non-interacting massive vector bosons, a simple solution is that the 
equation of motion 

(d 2 + M 2 )A» - d»d v A v = (2.25) 
derived from the Proca Lagrangian with an explicit mass term 

L = - l -F^ + l -M 2 A^, = d^Ay — d v A^ (2.26) 

6 See however [pi for an interesting speculation along this line. 
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eliminates the unphysical degree of freedom automatically, because of 

M 2 d^ = . (2.27) 

The operator equation ( |2.27| ) removes precisely the degree of freedom that 
would correspond to a negative norm state in a naive canonical quantization 
of all four components. 



2.6.1 Power Counting 



If the vector boson A is coupled to a conserved current in (|2.24|) , the operator 
equation ( p. 27 ) remains intact. However, even if the vector boson couples to 
conserved currents only, it is not true in general that contributions from the 
longitudinal part of the propagator 



p 2 - M 2 + ie 



M 2 



(2.28) 



cancel in physical amplitudes. Instead, except for exceptional cases, Q2.27|) has 
to be enforced explicitely in intermediate states. As a consequence, the power 
counting rules ( |A.89| ) used to classify effective Lagrangians are violated by 
the longitudinal polarization state. 



If the currents in ( 2.17 ) are conserved and correspond to an abelian sym- 
metry, i.e. 



[T a ,T b ] = 



[2.29) 



the longitudinal polarization state decouples nevertheless. This is trivial 
from the equations of motion for on-shell currents, but in order to show that 
the contributions from off-shell propagators cancel, one has to sum over all 
Feynman diagrams (see also section |3~2] ). For example, in the process e + e _ — > 
Z°Z°, the diagrams 



"(Ph 




Z°(k) 



[2.30) 



e~(p_) Z°(q) e"(p_ 

contribute equally to the Ward identity 

k^(q)M^(p + ,p„;k,q) = 



Z\q) 



(2.31) 
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In general, ( |2.29| ) is satisfied for the neutral current piece of the weak inter- 
action, because these particular T a are generated from the Pauli matrices 1 
and T3 by linear combination, direct product, and direct sum. 

Due to this decoupling, it is possible to replace the Proca propaga- 
tor (|2.28|) for abelian currents by a Stiickelberg propagator 



p 2 — M 2 + ie 



(^-c-o^fe)' (2 ' 32) 



which is compatible with the power counting rules (|A.89 ). Then the choice £ 



1 is particularly convenient for quick calculations, while the cancellation of £ 
dependent pieces provides useful cross checks in more complicated calcula- 
tions. With (|2.32|) , dA can be eliminated consistently from matrix elements 

(phys.l^A^lphys/) = (2.33) 

as a free field, but it does not necessarily vanish as an operator in the full 
indefinite metric Hilbert space. 

The situation is more involved for charged current interactions in which ( p. 29 
is violated. For example, in the process e + e~ — > W + W~ only one of the di- 
agrams corresponding to ( 2.30|) does not vanish 



e+ W + 

Mj$(p + ,p-',k,q)= (2 - 34) 

e" W~ 
and it does not satisfy the Ward identity by itself 

k^(q)M$(p + ,p^k,q) Kv(p + ){(q)u(p-) . (2.35) 

Since the W ± are charged, there is a second diagram with photon exchange 

e+ W + 

M$(p + ,p_-k,q)= ' ( 2 - 36 ) 

e" W~ 

but this does not cancel ( |2.35| ) for minimal coupling —>■ <9 M — ieA^ of 
the W ± . Therefore, we must not use a Stiickelberg propagator (|2.32|) for 



massive vector bosons coupled to charged currents naively. In an attempt to 
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salvage the power counting, we can take a hint from unbroken non-abelian 
gauge theories (a. k. a. Yang-Mills theories) and investigate another diagram 

e+ W + 

M(2.37) 

arising from a coupling of the Vi^'s to a third gauge boson, the Z°, which 
couples to the r 3 component of the current, in order to close the gauge algebra 
generated by the t ± . Indeed using the non-abelian gauge vertex and putting 
the outgoing Vt^'s on their mass-shell 



- M 2 

k^(q)M^\p + ,p.;k,q) oc —-^v{ P +){{q)u{v-) . (2.38) 



Thus ( |2.35| ) could be canceled with charge assignments that are compatible 
with the gauge algebra, as long as Mw = Mz- However, the sum of ( |2.34| ), 
(|2.36Q and (|2.37|) still couples to unphysical polarization states if the external 



bosons are massive, because it is impossible to cancel the massless photon 
pole this way. Moreover, adding the Yang-Mills interaction to the Proca 
Lagrangian ( |2.26| ) results in equations of motion that violate ( |2.27| ). There- 
fore, a successive plugging of the holes is not likely to succeed and we should 
take a step back and reconsider, how unitarity is implemented in covariantly 
quantized massless Yang-Mills theories. 

In addition to the two physical degrees of freedom there are four unphys- 
ical degrees of freedom in the covariant quantization of gauge theories: the 
scalar and longitudinal polarization states d (l A tt {x) and A L (x) of the gauge 
bosons are accompanied by two Faddeev-Popov ghosts r](x) and fj(x) [|S 



The quartet-mechanism [46j for Becchi-Rouet-Stora (BRS) [^7| invariant 



interactions guarantees that these unphysical degrees of freedom decouple 
from the two physical degrees of freedom. Only in an abelian gauge theory, 
as QED, do the unphysical polarization states and ghost decouple separately. 
Thus the physical Hilbert space in a Yang-Mills theory is not characterized 
by the simple Gupta-Bleuler condition ( |2.33p , but by the cohomology of the 
conserved BRS-charge 

Qbrs(*) = -z I d 3 ftr[ ttq(x) d t)(x) - ign (x) [A (x),T)(x)] 



2 



XQ = t 



+ l -l[r,( x ),r]{x)]dofj{x) j . (2.39) 
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The cohomology arises because the BRS-charge Qbrs projects on zero norm 
states 

||Qbrs|*>|| 2 = <*|Q!rs|*)=0, (2.40) 

due to 

Q ] brs = Qbrs (2.41a) 
Q 2 BRS = 0, (2.41b) 

and such states have to factored out of the physical states characterized by 

Q B R S |phys.) =0, (2.42) 

in order to construct a proper Hilbert space. For massive vector bosons, 
some other degree of freedom has to take the place of the now physical lon- 
gitudinal polarization state Al(x) in the quartet-mechanism. This degree of 
freedom can be provided by the Goldstone bosons of a spontaneous symme- 
try breaking and the quantum numbers will match, if, and only if, the broken 
symmetry is the gauge symmetry of the massive vector bosons. 



2.6.2 Hidden Symmetry 



Spontaneously broken gauge theories provide a consistent description of mas- 
sive vector bosons, in which the dimensions of the fields are indeed given 
by ( [A. 10 ) and ( A. 11 ). There is no argument that they are the only con- 
sistent description. However, such an argument is not necessary, because it 
is possible to use a non-linear field redefinition to rewrite any low energy 
effective interaction of vector bosons as the unitarity gauge version of a non 
linearly realized gauge theory ^8|, ^9], |5(|. In an abelian toy example^ there 
are the correspondences 



ip(x) 



X [x) 



e -im/f x ( x ) 



A^(x 



x 



(2.43) 



X is a matter field and is massive vector field, while is a Goldstone boson 
field and A^ a gauge field of a non-linearly realized U(l) 







<P(x) 


H 







D iuj(x) 



X{x) 



(x) + fuj(x) 



(2.44) 



7 In the non-abelian generalization, the physics is obscured slightly by the need to 
parametrize the factor group manifold G/H . Nevertheless, it is a straightforward appli- 
cation of the formalism set forth in plj] and explicit formulae are given in p9l p0| . 
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Without a specification of the subsidiary conditions for the quantization of 
the vector fields, it appears that (A^, <ft) has one more degree of freedom 
than Vp. However, is understood to be quantized with d^V^ = enforced 
explicitely, while (A^, <ft) is quantized in an indefinite norm Hilbert space 
where <ft and d^A^ conspire with the Faddeev-Popov ghosts to decouple from 
physical amplitudes |^6[ and the same number of degrees of freedom survive. 



Since the non-linear field redefinitions do not change the on-shell scattering 
matrix elements [[5T], the two formulations will yield the same predictions for 
observable quantities. 

The preceding reasoning does not prove that all vector bosons are gauge 
bosons in disguise, but it comes very close, because it shows that for every 
theory with massive vector bosons, there is another theory in which the vec- 
tor bosons are parametrized gauge bosons of a non-linearly realized gauge 
symmetry. These theories are indistinguishable at energy scales below the 
symmetry breaking scale A = 4nf characterizing the non-linear realization. 
Above the symmetry breaking scale, the two theories are of course very dif- 
ferent, because the gauge symmetry will then be realized linearly. 



In a covariant i?^-gauge, a special case of fl2.42j ) is the new subsidiary 
condition 

(phys.|<9^(x) - £m A 0(x)|phys.') =0. (2.45) 



that takes the place of ( |2.33| ). At energies large compared to the boson mass, 
the longitudinal polarization vector e^ L {k) can be approximated by k^/m^- 
Then ( j2.45| ) turns into the celebrated equivalence theorem |4"8| , |53J and the 
scattering amplitudes for longitudinal vector bosons can be calculated more 
efficiently by calculating the corresponding amplitudes for Goldstone bosons. 

2.6.3 Tree Level Unitarity 

A complementary analysis arrives at the same conclusion. A systematic 
study of the tree level unitarity of four-point functions and five-point func- 
tions calculated with arbitrary interactions reveals that only spontaneously 
broken gauge theories result in amplitudes that satisfy tree level unitarity at 
high energy j|S) (see also []53J for a less systematical but more pedagogical 



account). As could be expected from the study of the Ward identities, the 
production of longitudinal vector bosons 



v we 

X- 

wt 



^ = ^L. E 2 sin 2 d (246) 
dQ 32tt 2 cm v ; 
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Figure 2.4: Tree-level unitarity bound for vv — » W^W L . 



is responsible for this result. P-wave unitarity (|A4j=i| < 1) demands 



da 18 sin 2 9 



(2.47) 



which is displayed in figure |2.4| . Unitarity is maintained up to consider- 
ably higher scales than in figure [2.3| , but will eventually be violated at a 
few TeV. Continuing this heuristic "patching up" will unambiguously f4"8]| 
lead to a spontaneously broken gauge theory, identical to the one proposed 
in sections |2.6.1| and |2.6.2| . 



2. 7 The Minimal Standard Model 

Combining the low energy information on the flavor symmetries with the 
observation of the top quark and the description of weak interactions by 
gauge bosons of a non-linearly realized SU(2)l ® U(l)y/U(l)g and the un- 
broken SU(3)c* of QCD, results in the particles of the minimal SM lined 
up with their quantum number assignments in table |2.1| . Instead of realiz- 
ing SU(2) £ ® U(l)y/U(l)g non-linearly, it is also possible to introduce an 
elementary Higgs particle to break the electroweak symmetry. 

Both options provide a consistent theoretical framework for calculations 
and are not ruled out by experiment. However, a linear realization with 
a light Higgs near 100 GeV appears to be favored by current experimental 
results (see figure |i~2|) . 

As discussed at length in section |2.(j| , the low energy non-abelian charged 
currents require that among the equivalent effective theories describing the 
vector bosons, there is one in which the bosons are the non-abelian gauge 
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bosons of a broken symmetry. Therefore the non-abelian couplings shown 
in figure |1.5| are unavoidable and deviations from the gauge theory predic- 
tions have to be consistent with the power counting rules of EFT, i. e. Naive 
Dimensional Analysis (NDA), as described in section [A. 3. 5 . 



NDA is based on the hierarchy of two scales, here the Fermi scale v ~ 
246 GeV and Aewsb = 47ry ~ 3.1 TeV, with no intermediate scale. Therefore 
its predictions can be invalidated by the observation of a new physical thresh- 
old between v and A EWSB . However, such a threshold would be observed and 
NDA is a valid tool for studying electroweak gauge bosons on the TeV scale, 
if no new particles will have been discovered until then. 

Unless a new threshold is discovered or something very spectacular hap- 
pens that would undermine most our present understanding of QFT, the 
deviations from the minimal SM must be on the order of 

' 6.3 • 1(T 3 . (2.48) 
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From the predictions for the sensitivity at LEP2 on gauge boson couplings |23| . 
we see that fl2.48|) will not be reached. Thus LEP2 will not be able to distin- 
guish among models for EWSB, based on measurements of the gauge boson 
couplings alone. The direct search for a very light Higgs is more promis- 
ing. This situation changes will change dramatically at the LC, due to the 
increased energy and luminosity (see section |5.1|). 



2.8 Precision Tests 

The observation that low energy physics can be described by an effective 
field theory, does not mean that precision experiments can not access at least 



some aspects of the physics at higher scales through loop corrections |56|. If 
the low energy EFT has less symmetry than the EFT relevant at a higher 
energy scale, the parameters of the low energy EFT can be tested for the 
remnants of the high energy symmetry. The most celebrated example of this 
phenomenon at LEP1 are the radiative corrections to the p-parameter and 
the evidence for a custodial SU(2) C symmetry [Q of the EWSB sector (see 
also section IA.3.11) . 



At low energies, in the current-current couplings (|2.15|) , the relative strengths 
of the weak and neutral current couplings is a free parameter. After the intro- 
duction of intermediate vector bosons, the two dimensionful parameters are 
interpreted as the ratio of dimensionless couplings and vector boson masses. 
Above the production threshold of the vector bosons, their masses can be 
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measured independently from the strength of weak interactions. From the 
observed values, they appear to be related because the p-parameter 



( 2 49) 

cos 2 ^Mf { ' 



is very close to unity, with the renormalization scheme dependent^] deviations 
in the SM on the order of 1%. In (|2.49| ), the weak mixing angle 6 W is defined 



e = g sin 6 W (2.50) 

by the ratio of the left handed charged current couplings and the electromag- 
netic coupling. 

Of course, p — 1 could be just a numerical accident, unless it can be 
explained by a symmetry. In the minimal SM, p = 1 is explained by the 
fact that all three components of the isospin-triplet charged gauge boson 
receive the same mass from the coupling to the EWSB sector, before the 
neutral component mixes with the hypercharge gauge boson. In the linear 
realization of the EWSB, the global SU(2) C <g> SU(2) L symmetry is equivalent 
to the SO(4) of the four components of the complex Higgs doublet. In a 
Higgs-less realization, the SU(2) C <g> SU(2) L <g> U(l) y /SU(2) c+L ® U(1) Q has 
to be realized non-linearly |TT| . Apparently, whatever the detailed dynamics 
of EWSB is, it respects the custodial symmetry relating W + , W~ , and W 3 = 
cos 9 W Z° + sin 6 W A. 

The most important deviations from p = 1 in the minimal SM depend 
on m t (see section |A.3.1|) . Therefore, the joint efforts of LEP1, LEP2, SLC 



and Tevatron in the measurement of Mz, Mw, sin^, and mt have provided 
strict constraints on the symmetry of the EWSB sector. The TeV-scale 
physics probed in this way exceeds the direct physics reach of the contributing 
colliders (see figure |1.1| ) by an order of magnitude. 



8 There are mass dependent renormalization schemes that actually enforce p — 1 at one 
scale. The non-trivial observation is then that p does not "run away" from this value. 
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— 3— 
Matrix Elements 



The quest for a theory of flavor demands precise calculations of high energy 
scattering processes in the framework of the SM and its extensions. At the 
Tevatron, the Large Hadron Collider (LHC), and at the LC, final states with 
many detected particles and with tagged flavor will be the primary handle 
for testing theories of flavor. 



3.1 Three Roads from Actions to Matrix Elements 

3.1.1 Manual Calculation 

The traditional textbook approach to deriving amplitudes from actions are 
manual calculations, which can be aided by computer algebra tools. The 
time-honored method of calculating the squared amplitudes directly using 
trace techniques is no longer practical for today's multi particle final states 



and has generally been replaced by helicity amplitude methods (see e. g. [57 



Manual calculations have the disadvantage of consuming a lot of valuable 
physicist's time, but can provide insights that are hidden by the other ap- 
proaches discussed below. Regarding basic processes, where extremely fast 
and accurate predictions are required for numerical fits, manual calculations 
can provide efficient formulae that are still unrivaled. 



3.1.2 Computer Aided Calculation 

An increasingly popular technique is to use a well tested library of basic helic- 
ity amplitudes for the building blocks of Feynman diagrams and to construct 
the complete amplitude directly in the program in the form of function calls. 
A possible disadvantage is that the differential cross section is nowhere avail- 
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able as a formula, but the value of such a formula is limited anyway, since 
they can hardly be printed on a single page anymore. 



3.1.3 Automatic Calculation 

Fully automatic calculations are a further step in the same direction. The 
Feynman rules (or equivalent prescriptions) are no longer applied manually, 
but encoded algorithmically. This method will become more and more impor- 
tant in the future, but more work is needed for the automated construction 
of efficient unweighted event generators (see section 4.1.4Q and for the auto- 
mated factorization of collinear singularities in radiative corrections (see 



3.2 Forests and Groves 

Calculations of cross section with many-particle final states remain challeng- 
ing despite all technical advances and it is of crucial importance to be able 
to concentrate on the important parts of the scattering amplitude for the 
phenomena under consideration. 

In gauge theories, however, it is impossible to simply select a few signal di- 
agrams and to ignore irreducible backgrounds. The same subtle cancellations 
among the diagrams in a gauge invariant subset that lead to the celebrated 
good high energy behavior of gauge theories such as the SM (see section j2.6|), 



come back to haunt us if we accidentally select a subset of diagrams that is 
not gauge invariant. The results from such a calculation have no predictive 
power, because they depend on unphysical parameters introduced during the 
gauge fixing of the Lagrangian. It must be stressed that not all diagrams in 
a gauge invariant subset have the same pole structure and that a selection 
based on "signal" or "background" will not suffice. 

The subsets of Feynman diagrams selected for any calculation must there- 
fore form a gauge invariant subset, i. e. together they must already satisfy the 
Ward- and Slavnov- Taylor-identities to ensure the cancellation of contribu- 
tions from unphysical degrees of freedom. 

In abelian gauge theories, such as QED, the classification of gauge invari- 
ant subsets is straightforward and can be summarized by the requirement of 
inserting any additional photon into all connected charged propagators, as 



shown in figure |3J]. This situation is similar for gauge theories with simple 
gauge groups, the difference being that the gauge bosons are carrying charge 
themselves. For non-simple gauge groups like the SM, which even includes 
mixing, the classification of gauge invariant subsets is much more involved. 
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Figure 3.1: The gauge invariant subsets in e + e — > fi + jj, 7 are obtained by 
adding the photon in all possible ways to connected lines of charged particles. 



In early calculations, the classification of gauge invariant subsets in the SM 



has been performed in an ad-hoc fashion (see |62], j63fl), which was later super- 



seded by the explicit construction of groves ||64|| , the smallest gauge invariant 
classes of tree Feynman diagrams in gauge theories. This construction is not 
restricted to gauge theories with simple gauge groups. Instead, it is applica- 
ble to gauge groups with any number of factors, which can even be mixed, 
as in the SM. Furthermore, it does not require a summation over complete 
multiplets and can therefore be used in flavor physics when members of weak 
isospin doublets (such as charm or bottom) are detected separately. The 
method constructs the smallest gauge invariant subsets and examples below 
illustrate that they are indeed smaller than those derived from looking at the 
final state alone |J2|, |S3"fl . 

The method will probably also have applications in loop calculations. 
However, the published proofs use properties of tree diagrams and further 
research is required in this area. 

In unbroken gauge theories, the permutation symmetry of external gauge 
quantum numbers can be used to subdivide the scattering amplitude corre- 
sponding to a grove further into gauge invariant sub-amplitudes (see |J5| and 
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references cited therein). In this decomposition, each Feynman diagram con- 
tributes to more than one sub-amplitude. It is not yet known how to perform 
a similar decomposition systematically in the SM, because the entanglement 
of gauge couplings and gauge boson masses complicates the structure of the 
amplitudes. The construction of the groves provides a necessary first step 
towards the solution of this important problem. 

3.2.1 Gauge Cancellations 

Even if the general arguments are well known, the explicit cancellations of 
unphysical contributions remain amazing in explicit perturbative calcula- 
tions in gauge theories. The intricate nature of these cancellations compli- 
cates the development of systematic calculational procedures. Nevertheless, 
phenomenology needs numerically well behaved matrix elements and cross 
sections, which should be as compact as possible. The development of an 
automated procedure for obtaining compact expressions with explicit gauge 
cancellations remains a formidable challenge. The selection of gauge invari- 
ant subsets of Feynman diagrams described in the remainder of this section 
is one necessary step towards this goal. 

As discussed in chapter ^ there is overwhelming phenomenological evi- 
dence that the strong and electroweak interactions are mediated by vector 
particles. A naive description of vector particles would assign to them four 
degrees of freedom or polarization states, of which only two (for massless 
particles) or three (for massive particles) can be physical. The only known 
theories that can consistently decouple the unphysical degrees of freedom for 
interacting fields are gauge theories (for massless particles) or equivalent to 
spontaneously broken gauge theories (for massive particles). 

However, perturbative calculations require an explicit breaking of gauge 
invariance for technical reasons and the cancellation of unphysical contribu- 
tions is not manifest in intermediate stages of calculations. The contribution 
of a particular Feynman diagram to a scattering amplitude depends in the 
gauge fixing procedure and has no physical meaning, in general. 

Among the possible gauge fixing schemes, two complementary approaches 
can be identified: unitarity gauges and renormalizable gauges. A massive 
gauge boson in unitarity gauge has a propagator 

and unitarity of the time evolution can be proved straightforwardly, because 
only physical degrees of freedom propagate and the condition d^V 11 = is 
maintained. Unfortunately, the longitudinal part of the propagator ( PTTJ ) 
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does not fall off with high energy and momentum. This spoils the power 
counting for renormalization and, even worse, violates tree-level unitarity 
at high energies, because the scattering amplitudes for longitudinal vector 
bosons do not fall off rapidly enough to satisfy the optical theorem at high 
energies. 

On the other hand, the family of i?g-gauge fixings results in propagators 
that propagate unphysical degrees of freedom 

.<//»'-<! - 2 It, , J (3-2) 



p 2 — M 2 + %t V M V - £M 2 + it 

but with good high energy behavior and standard power counting even for the 
longitudinal components. The cancellation of the unphysical contributions 
in scattering matrix elements turns out to be a formidable combinatorial 



problem [Be 



The second approach is superior for formal proofs to all orders (see |71[ 



and references therein), because it allows to treat the two complicated prob- 
lems independently. In a first step, the theory is defined using a Bolgoliubov- 
Parasiuk-Hepp-Zimmermann (BPHZ) subtraction procedure f73]| , which vio- 
lates gauge invariance and unitarity, but allows to prove the Quantum Action 
Principle (QAP) J7^|. The QAP states that the classical symmetries of the 
Lagrangian density L are respected by the effective action T, upto a lo- 
cal operator of dimension five (see also J75fl). Subsequently, one can prove 
by an inspection of all operators of dimension five, that the BRS invari- 
ance of the effective action can be restored by finite and local counterterms 
of dimension four, provided that the coefficient of the Adler-Bell-Bardeen- 
Jackiw (ABBJ) anomaly vanishes. Finally, the unitarity of the BRS invariant 



theory is proven with the quartet-mechanism J46 . 

This two-step procedure is transparent because it decouples the combina- 
torics of renormalized perturbative power counting from the combinatorics 
of gauge invariance and BRS invariance. Once the QAP is established, BRS 
invariance is reduced to a problem in finite-dimensional cohomology. On 
the other hand, controlling the high energy behavior of the unitarity gauge 
propagator (|3.1j) and the combinatorics of Ward identities simultaneously is 
prohibitively complicated, even in low orders of the loop expansion. 

However, this decoupling of kinematical structure and gauge structure, 
which makes the abstract argument transparent, is not useful in concrete 
calculations, because the abstract arguments apply only to the sum of all 
diagrams and shed little light on how the gauge cancellations "work". One 
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particular problem is that the Ward identities relate diagrams with different 
pole structure. For example, in qq — ► gg 




(3.3) 



the numerator factors cancel parts of denominators and the Ward identity is 
satisfied only by the sum and not by individual diagrams. 

Therefore, the physical amplitudes are determined by an intricate web of 
kinematical structure and gauge structure, that is very hard to disentangle. 
Nevertheless, the identification of partial sums of Feynman diagrams that are 
gauge invariant by themselves is of great practical importance for at least two 
reasons. Firstly, it is more economical to spent the time improving Monte 
Carlo statistics for the important pieces of the amplitude than to calculate 
the complete amplitude all the time. This requires the identification of the 
smallest gauge invariant part of the amplitude that contains the important 
pieces. Secondly, different parts of the amplitude are characterized by differ- 
ent scales (e. g. s or t) for radiative corrections. The different scales can only 
be used if the parts correspond to separately gauge invariant pieces. 

The practical consequence of these arguments for LEP2 physics is that 
each individual diagram in the e + e~ — > W + W~ amplitude 



(3.4) 



has a bad high energy behavior pj^ and only the inclusion of subtle interfer- 
ences in the sum results in good high energy behaviorQ 

The on-shell pair production (|3.4j) is unfortunately not the observed 
process. This causes two problems: the finite width necessitates Dyson- 
resummation and higher orders violate gauge invariance [[Sl}| . Furthermore, 
the observed final state consists of many observed particles with tagged flavor. 
Therefore there are many tree-level background diagrams that contribute to 
the same process. For this reason, the calculations are complicated and it is 
important to identify the important parts. 




3.2.2 Flavor Selection Rules 

The interesting flavor physics at the LHC and at the LC is different from QCD 
with light quarks because the gauge symmetry is broken spontaneously and 

1 The good high energy behavior is realized separately for the SU(2)l and U(l)y parts 
of the amplitude pOy , but this requires breaking up diagrams. 



36 



e 

\ 


rvvvvvv< 


e 

/ 


e+ 




e+ 



e 


e 








e+ 



Figure 3.2: Gauge invariant subsets of Feynman diagrams in Bhabha scat- 
tering. 



the gauge group is not simple. Furthermore, the factors are mixed according 
to 



Z° 
A 



cos 9 W — sin 6 W 
sin 6 W cos 6 W 



W 3 
B 



(3.5) 



and the combinatorics are consequently more intricate. Finally, the flavor 
of leptons and heavy quarks is tagged and the flavor transformations do not 
commute with the gauge transformations and no summation over complete 
gauge multiplets is possible. For these reasons, the QCD recursion relations 
or string inspired methods |65| are not directly applicable. 

One method of identifying gauge invariant subsets is to identify subsets 
that contribute to a particular final state For example, there are ten 

tree diagrams contributing to e + e~ — > fi~u^ud and therefore the correspond- 
ing subset of the 20 diagrams in e + e~ — > e~u e ud and its complement must 
be both gauge invariant by themselves. The subsets that can be derived in 
this way are in general not minimal, but this example shows that selection 
rules of flavor symmetries that commute with the gauge group appear to be 
a useful tool. 

The simplest example of these flavor selection rules is provided by Bhabha 
scattering: the s-channel and the t-channel diagram in figure |3.2j have to 
be gauge-invariant separately, because both e + e~ — ► /i + yU~ and e + /i - — ► 
e + fi~ are physical processes that must give gauge invariant amplitudes by 
themselves. The first process has only the s-channel diagram and the second 
only the t-channel diagram. As a result, we see that conserved currents of 
(real or fictitious) horizontal (i. e. generation) symmetries can serve as a tool 
to separate gauge invariant classes of Feynman diagrams. 

A less trivial example is provided by the three separately gauge invariant 
sets in ud ud scattering shown in figure O. The separate gauge invar i- 
ance of the gluon exchange diagram is obvious because the QCD generators 
commute with the electroweak generators and the strong coupling can be 
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Figure 3.3: Gauge invariant subsets of Feynman diagrams in ud — * ud scat- 
tering. 



switched on and off without violating gauge invariance. The charged current 
diagram is separately gauge- invariant, because we may assume that the CKM 
mixing matrix is diagonal and then the charged current diagonal is absent in 
us — > us scattering, which is related to ud — > ud by a horizontal symmetry 
that commutes with the gauge group. 

These example in 2 — > 2 scattering are not useful by themselves, unless we 
can handle the combinatorics in the more complicated realistic applications. 

3.2.3 Forests 

In order to develop tools for understanding the combinatorics, we have to 
take a step back from the intricacies of gauge theories and study unflavored 
scalar 3 - and 4 -theory. We assume that no <fi 5 or higher vertices are re- 
quired by phenomenology. In principle, the combinatorial methods describe 
below can be extended to non-renormalizable theories. However, the required 
arguments would have be more involved, without providing additional insight 
into the gauge sector. Since there are no selection rules, the diagrams Si, S 2 , 
and S3 in 

rp _ r.S,l ,S,2 ,5,3 + S,4\ _ 

14 — \t 4 , t 4 , t 4 , t 4 j- — 

(3.6) 
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must have the same coupling strength to ensure crossing invariance. If there 

S 4 

are additional symmetries, as in the case of gauge theories, the coupling of t 4 ' 
will also be fixed relative to tf' 1 , tf' 2 , and tf 3 . 

Henceforth, we will call each exchange t *->■ t' of two members of the 
set T 4 of all tree graphs with four external particles an elementary flip. The 
elementary flips define a trivial relation tot' on T 4 , which is true, if and only 
if t and t' are related by a flip. The relation o is trivial on T 4 because all 
pairs are related by an elementary flip. 

However, the elementary flips in T 4 induce flips in T 5 (the set of all tree 
diagrams with five external particles) if they are applied to an arbitrary four 
particle subdiagram 




(3.7a) 



Obviously, there is more than one element of T 4 embedded in a particular 
t G T 5 and the same diagram is member of other quartets, e. g. 




(3.7b) 



There are 15 five-point tree diagrams in <f) 3 and inspection shows that only 
four other inequivalent diagrams can be reached from any diagram by flips 
of a four-point subdiagram. Thus there must be a non-trivial mathematical 
structure on the set T5 with the relation o. This structure can be visualized 
by the graph in figure |3.4| . Similarly, there are 25 five-point tree diagrams in 
combined 4> 3 and 4> 4 and at most six can be reached from any diagram by a 
flip. 

In the same way, the trivial relation on T 4 has a non-trivial natural ex- 
tension to the set T n of all n-point tree diagrams: tot' is true if and only if t 
and t' are identical up to a single flip of a four-point subdiagram 

t o t' 3t 4 e T 4 , t' 4 E T 4 : U o t' A A t \ U = t' \ £ 4 (3.8) 

Note that this relation is not transitive and therefore not an equivalence 
relation. Instead, this relation allows us to view the elements of T n as the 
vertices of a graph F n , where the edges of the graph are formed by the pairs 
of diagrams related by a single flip 

F n = {{t,t') eT n xT n \tot'} . (3.9) 
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((25)4)3 



2((34)5) 



2(3(45)) 




((23)4)5 

Figure 3.4: The forest F§ of the 15 five-point tree diagrams in unflavored <ft 3 - 
theory. The diagrams are specified by fixing vertex 1 and using parentheses 
to denote the order in which lines are joined at vertices (see the footnote on 
page for examples). 



To avoid confusion, we will refer to graph F n as forest and to its vertices as 
Feynman diagrams. 

The most important property of the forests for our applications is 

Theorem 1 The unflavored forest F n is connected for all n. 

which is easily proved by mathematical induction on n. Indeed, all members 
of T n+ \ can be derived from T n by connecting a new external line to a prop- 
agator (forming a new 3 -vertex) or an existing 3 -vertex (transforming it 
into 4 - vertex). Obviously, two such insertions are related by a finite num- 
ber of flips, which proves the induction step. This theorem shows that it is 
possible to construct all Feynman diagrams by visiting the nodes of F n along 
successive applications of the flips in ( ^6|) . 

Already the simplest non-trivial example of a unflavored ( "vanilla" ) for- 
est, the 15 tree diagrams with five external particles in unflavored 3 -theory, 
as shown in figure |3.4j , displays an intriguing symmetry structure: there are 
120 permutations of the vertices of F§ that leave this forest invariant. The 
automorphism group Aut(i<5) is therefore with 120 elements much larger 
than one would expect. The classification of vanilla forests is probably an in- 
teresting mathematical problem, but without obvious physical applications. 
However, as we will see below, not all flips are created equal and only some 
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flips are required by gauge invariance, leading to a partition of the forest into 
gauge invariant subsets, called groves |64}| . 



Vanilla Trees 



The enumeration of tree diagrams is simplified considerably if we can find a 
unique representative for each topological equivalence class [p2 . One such 
representation is defined by picking one external line as the root and drawing 
the reminder of the diagram as a mathematical tree. The edges at each 
node are sorted according to the index of the external lines beneath each 
node. The sorting lifts the remaining degeneracy and there remains only one 
representative for each topological equivalence class. To be specific, here is 
how the members of T 4 are represented 




(3.10a) 



(3.10b) 



(3.10c) 



(3.10d) 



In this represent at ionP], the flips are equivalent to a recursive tree pattern 
matching 




(3.11a) 



2 Instead of the graphical representation used in ( |3.1C| — 13.12| ), a notation based on 
parenthesis has been used in figure |3.4| . In this less intuitive but more concise notation, 
the four trees in (3.10) are denoted by a(bc), c(ab), b(ac), and abc, respectively. The 
matching in (3.11b) is written (ab)(cd) =4> {a(b(cd)),b((cd)a),c(d(ab)),d((ab)c)}. 
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where each of a, 6, c can be either a complete subtree or a single leaf. In 
these pattern matchings, a subtle special case in which two eligible vertices 
meet, has to be taken care of 



v c d c d a b 

in order not to miss a branch. However, cases like 




(3.11b) 




(3.12) 



c d 



need no special rules because the root vertex will only be expanded and never 
be contracted with another vertex. 

This unique representation of tree diagrams provides an efficient equality 
predicate and the recursive pattern matching provides a function calculating 
the neighbors 



of any tree t in the forest. Using these two ingredients, the construction 
of the forest is a well defined mathematical problem with efficient textbook 
solutions, known as depth first search and breadth first search. Essentially, 
these solutions consist of recursively following the edges to the neighbors 
until all nodes have been visited, keeping track of the visited nodes to avoid 
loops. 

If the forest is connected (from theorem [I], the vanilla forests are known 
to be connected) and a single tree is known, the construction of the forest is 
also an efficient algorithm for generating all Feynman tree diagrams. 



In physics applications, we have to deal with multiple flavors of particles. 
Therefore we introduce flavored forests, where the admissibility of elementary 
flips tot' depends on the four particles involved through the Feynman rules 
for the vertices in t± and t' . 

In order to simplify the combinatorics when counting diagrams for theo- 
ries with more than one flavor, we will below treat all external particles as 
outgoing. The physical amplitudes are obtained later by selecting the incom- 
ing particles in the end. Ward identities, etc. will be proved for the latter 
physical amplitudes, of course. 



<j)(t) = {? e T\t' o 



(3.13) 



3.2.4 Flavored Forests 
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Flavored Trees 



Flavored trees are derived from vanilla trees by adding a label to each edge. 
Feynman rules are then used at each node to identify valid flavor combina- 
tions. 

All flavored trees can be generated from vanilla trees (which correspond 
to "topologies") by a recursive tensor product of elementary alternatives. 
E.g. 

y z° g 



UU VUUUUUU 

The representation described in section |3.2.3j allows to use efficient tree ho- 
momorphisms for the necessary manipulations. In particular, the calculation 
of the algebraic expressions corresponding to flavored trees is a tree homo- 
morphism: 

y 

u(p 1 )^^u(p 2 ) ■ (3.15) 
u u 

If there were no Feynman rules, the flavored forests would be connected just 
like the vanilla forests. However, the Feynman rules typically prohibit most 
of the possible flavor assignments and it turns out that the remaining vertices 
form more than one connected component. 

3.2.5 Groves 

The construction of the groves is based on the observation that the flips in 
gauge theories fall into two different classes: the flavor flips among 

{tf'\tf' 2 ,tf}H > { ■ \ • \X )■ (3-16) 



MIX 



which involve four matter fields that carry gauge charge and possibly addi- 
tional conserved quantum numbers and the gauge flips among 



{tf\^\tf\t G ^}={ V— <, l , &C , ~X } (3.17a) 




43 



and 

{ t r,tr,tf\tr} = < i . iv . >—< . x ;> (3.171)) 




G 8 

which also involve external gauge bosons (the diagram £ 4 ' , is only present 
for scalar matter fields that appear in extensions of the SM: SUSY partners, 
leptoquarks, etc.). In gauge theories with more than one factor, like the SM, 
the gauge flips are extended in the obvious way to include all four-point 
functions with at least one gauge-boson. Commuting gauge group factors 
lead to separate sets, of course. In spontaneously broken gauge theories, the 
Higgs and Goldstone boson fields contribute additional flips, in which they 
are treated like gauge bosons. Ghosts can be ignored at tree level. 

As we have seen for the examples of Bhabha-scattering and ud — > ud 
scattering in section |3.2.2| , the flavor flips ( |3.16|) are special because they can 



be switched off without spoiling gauge invariance by introducing a horizontal 
symmetry that commutes with the gauge group. Such a horizontal symmetry 
is similar to the replicated generations in the SM, but if three generations do 
not suffice, it can also be introduced artificially. This observation suggests 
to introduce two relations: 

t • t' -<=>- t and t! related by a gauge flip (3.18a) 

and 

tot' -<=>• t and t! related by a flavor or gauge flip. (3.18b) 

These two relations define two different graphs with the same set T(E) of all 
Feynman diagrams as vertices: 

F(E) = {(t,f) G T(E) x T(E)\tot'} (3.19a) 

and 

G{E) = {(t,f) G T(E) x T(E)\ft'}. (3.19b) 

As we have seen above in Bhabha-scattering, it is in general not possible to 
connect all pairs of diagrams in G(E) by a series of gauge flips. Thus there 
will be more than one connected component. For brevity, we will continue 
to denote the flavor forest F(E) as the forest of the external state E and we 
will denote the connected components Gi(E) of the gauge forest G(E) as the 
groves of E. Since t • t' => t o t', we have \J. G^E) = G(E) C F(E), i. e. the 
groves are a partition of the gauge forest. 
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Figure 3.5: The three diagrams in qq — > gZ 0/ y are connected by successive 
gauge flips of subdiagrams like (\3.17tQ . If the decays of the Z° were included, 
bremsstrahlung from the decay products would form a separate grove, because 
no gauge flip will pass the photon across the uncharged Z°. 



Theorem 2 The forest F(E) for an external state E consisting of gauge 
and matter fields is connected if the fields in E carry no conserved quantum 
numbers other than the gauge charges. The groves Gi(E) are the minimal 
gauge invariant classes of Feynman diagrams. 

By construction, the theorem is true for the four-point diagrams and we 
can use induction on the number of external matter fields and gauge bosons. 
Since the matter fields are carrying conserved charges, they can only be added 
in pairs. 

If we add an additional external gauge boson to a gauge invariant am- 
plitude, the diagrammatical proof of the Ward and Slavnov- Taylor identities 
in gauge theories requires us to sum over all ways to attach a gauge boson 
to connected gauge charge carrying components of the Feynman diagrams. 
However, the gauge flips are connecting pairs of neighboring insertions and 
can be iterated along gauge charge carrying propagators. Therefore no par- 
tition of the forest F(E) that is finer than the groves Gi(E) can preserve 
gauge invariance. Two examples are shown in figures |3.5| and |3.6| . 

If we add an additional pair of matter fields to a gauge invariant ampli- 
tude, we have to consider two separate cases 




(3.20) 



If the new flavor does not already appear among the other matter fields, the 
only way to attach the pair is through a gauge boson, as in the left diagram 
in the braces in ( |3.20| ). If the new flavor is already present, we can also 
break up a matter field propagator and connect the two parts of the diagram 
with a new gauge propagator. Since it is always possible to introduce a new 
flavor, either physical or fictitious, without breaking gauge invariance, these 
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Figure 3.6: All five diagrams in ud — > cs"f are in the same grove, because 
they are connected by gauge flips passing through the diagram in the center. 
In contrast, in e + e~ — > / u + ^7 (see figure \3.J\ ), the center diagram is missing 
and there are two separate groves. 



cases fork off separately gauge invariant classes every time we add a new 
pair of matter fields. On the other hand, the cases in fl3.20| ) are related by 
a flavor flip. Therefore F(E) remains connected, the Gi(E) are separately 
gauge invariant and the proof is complete. 

Earlier attempts [^, [63| have used physical final states as a criterion for 
identifying gauge invariant subsets. We have already shown that the groves 
are minimal and therefore never form a more coarse partition than the one 
derived from a consideration of the final states alone. Below we shall see 
examples where the groves do indeed provide a strictly finer partition. 

In a practical application, one calculates the groves for the interesting 
combinations of gauge quantum numbers, such as weak isospin and hyper- 
charge in the SM, using an external state where all other quantum numbers 
are equal, since the flavor forest would otherwise be disconnected. The com- 
binatorics is simplified by treating all particles as outgoing. The physical 
amplitude is then obtained by selecting the groves that are compatible with 
the other quantum numbers of the process under consideration. Concrete 
examples are considered in the next section. 

If only a few flavor combinations are interesting, it can be more conve- 
nient in practical applications to generate all diagrams for the external states 
conventionally and to partition the disconnected flavor forest. The resulting 
groves will be the same, of course. 
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3.2.6 Application 



In table 3.1 , we list the groves for all processes with six external massless 
fermions in the SM, with and without single photon bremsstrahlung, with- 
out QCD and CKM mixing. We could also include fermion masses, QCD, 
and CKM mixing within the same formalism, but the table would have to 
be much larger, because additional gluon, Higgs and Goldstone diagrams 
appear, depending on whether the fermions are massive or massless, colored 
or uncolored. In the table, cases with identical SU(2)i ® U(l)y quantum 
numbers are listed only once and cases with different T 3 and Y are listed 
separately only if the vanishing of the electric charge removes diagrams from 
a grove. E. g. e + e~e + e~e + e~ is identical to uuuuuu and not listed separately. 
The familiar non-minimal gauge invariant classes for e + e~ —>■ /1/2/3/4 



B2fl are included in table [D] as special cases. The LEP2 Wiy-classes CC09, 
CC10, and CC11 are immediately obvious. As a not quite so obvious exam- 
ple, the process e + e~ — > fi + h~t + t~ has the same SU(2)x quantum numbers 



Table 3.1: The groves for all processes with six external massless fermions 
and bremsstrahlung in the SM (without QCD). All particles are considered 
as outgoing. 



external fields (E) 


# diagrams 


groves 


uuuuuu 


144 


U 


1-8 


uuuuuuq 


1008 


U 


I- 24 + 36- 16 


uuuudd 


92 


4 


•11 + 6-8 


uuuudd'j 


716 


4 


•95 + 6 -24+ 12 -16 


e+i-uudd 


35 


1 


■11 + 3-8 


£ + £~uudd^ 


262 


1 


•94 + 3-24 + 6-16 


t+vdudd 


20 


2 


• 10 


£ + ududd^ 


152 


2 


• 76 


£ + £-£ + vdu 


20 


2 


• 10 


£ + £-£ + uduj 


150 


2 


• 75 


£ + v£~vdd 


19 


1 


•9 + 2-4 + 1-2 


£ + u£-jjdd-f 


107 


1 


■59 + 2- 12 + 2- 8 + 2-4 


£ + v£~v£ + £- 


56 


4 


■9 + 4-4 + 2-2 


£ + v£-v£ + £--i 


328 


4 


•58 + 4-12 + 4-8 + 4-4 


£ + v£~vvv 


36 


4 


■6 + 6-2 


£ + v£~vvi"-y 


132 


4 


•26 + 2-6 + 4-4 


vvvvvv 


36 


tt 


\-2 
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Figure 3.7: The two groves in the CC20 process e + e~ — ► e~v e ud. Gauge 
flips are drawn a solid lines and flavor flips are drawn as dotted lines. These 
groves can be found in table \3.\ as l + 1~ £ + vdu. 



as uu — > uuuu. We can read off table |3.1| that, in the case of identical pairs, 
there are 18 groves, of 8 diagrams each. If all three pairs are different, the 
number of groves has to be divided by 3!, because we are no longer free 
to connect the three particle-antiparticle pairs arbitrarily. Thus there are 
24 diagrams contributing to the process e + e~ — > fi + [i~T + T~ and they are 
organized in three groves of 8 diagrams each. Any diagram in a grove can 
be reached from the other 7 by exchanging the vertices of the gauge bosons 
on one fermion line and be exchanging Z° and 7. Since there are no non- 
abelian couplings in this process, the separate gauge invariance of each grove 
could also be proven as in QED, by varying the hypercharge of each particle: 
A oc Ai ■ qlq^qr + A 2 ■ g e ^<2V + A 3 ' ^q^l- 

As is well known, the diagrams for the CC20 process e + e~ — > e~v e ud fall 
into two separately gauge invariant classes, depicted in figure |3]7| (including 
processes related by crossing symmetry). Figure |3.8| shows two views of the 
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Figure 3.8: Two views of e + e — > v e v e v e v e . Gauge flips are drawn a solid 
lines and flavor flips are drawn as dotted lines. 



groves for e + e~ — ► v e v e v e v e (including processes related by crossing symme- 
try). The left hand side emphasizes the four- fold permutation symmetry of 
flavor flips, while the right hand side emphasizes the groups of six diagrams 
forming a single grove. The process e + e~ — > uui/uj is not yet the most press- 
ing physics problem, but small enough to produce the nice picture shown in 
figure p.9| . The decomposition 4 - 26 + 2- 6-1-4-4 is clearly visible. Finally, fig- 
ure p.10 displays the forest for the process 77 — > uddu in the SM . The grove 
in the center consists of the 31 diagrams with charged current interactions 
(the set CC21 of The four small groves of neutral current interactions 

are only connected with the rest of the forest through the charged current 
grove. The automorphism group of this forest has 128 elements. 

The groves can now be used to select the Feynman diagrams to be calcu- 
lated by other means. However, we note that it is also possible to calculate 
the amplitude with little additional effort already during the construction of 
the groves by keeping track of momenta and couplings in the diagram flips. 
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Figure 3.9: The decomposition 4 - 26 + 2- 6 + 4- 4 of e + e — > vvvv^f. Gauge 
flips are drawn a solid lines and flavor flips are drawn as dotted lines. 



3.2.7 Automorphisms 

The forests and groves that we have studied appear to be very symmetri- 
cal in the neighborhood of any vertex. However, the global connection of 
these neighborhoods is twisted, which makes it all but impossible to draw 
the graphs in a way that makes these apparent symmetries manifest (see 
figure |3J|) . 

Nevertheless, one can turn to mathematics (in particular to computa- 
tional and constructive group theory |83| ) and construct the automorphism 
groups Aut(F(E)) and Aut(C i (£?)) of the forest F{E) and the groves Gi(E), 
i.e. the group of permutations of vertices that leave the edges invariant. 
These groups turn out to be larger than one might expect. For example, 
the group of permutations of the 71 vertices of the forest ^(77 — > uddu) in 
figure [3.101 , that leave the edges invariant, has 128 elements. Similarly, the 
automorphism group of the forest in figure EO] has 120 = 5! elements. The 



study of these groups and their relations might enable us to construct gauge 



50 



Figure 3.10: The forest of size 71 for the process 77 — > uddu in the SM 
(without QCD, CKM mixing and masses in unitarity gauge) with one grove 
of size 31, two of size 12 and two of size 8. Solid lines represent gauge flips 
and dotted lines represent flavor flips. 



invariant subsets directly. 

3.2.8 Advanced Combinatorics 

Even pure gauge forests can provide new insights into the combinatorics of 
gauge cancellations among Feynman diagrams. Consider the process gg — ► 
ggg: there are 25 diagrams (all in the same grove, of course). 15 of them 
have three triple gluon vertices and the remaining 10 diagrams have one triple 
and one quadruple vertex each. The sub-forest of the 15 diagrams with triple 
vertices is equivalent to the one shown in figure The 30 endpoints of the 
gauge flips starting from the 10 diagrams with a quadruple vertex cover this 
sub-forest with 15 nodes exactly twice. 

Therefore, the gauge cancellations do not follow from a simple recursion 
of the gauge cancellations in gg — > gg, where the sub-forest of diagrams with 
triple couplings is covered exactly once. Unfortunately, the symmetries in 
gg — > gggg are much more complicated. Nevertheless, the automorphism 
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groups of groves should be studied in order to understand the gauge cancel- 
lations better. 

3.2.9 Loops 

The number of loops is invariant under flips, as can be seen from Euler's for- 
mula. However, some of the proofs above have made explicit use of properties 
of tree diagrams. Therefore further research is required in this area. 
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Phase Space 



Not from the stars do I my judgment pluck, 
And yet methinks I have astronomy 
[William Shakespeare: Sonnet XIV] 

The calculation of matrix elements using the methods outlined in chapter § 
is but the first step towards a comparison of theoretical predictions with 
experiment. In the second step, the differential cross section obtained from 
squaring the amplitude has to be integrated in the region of phase space 
corresponding to the detector. While this step is common to all observables, 
it remains highly non-trivial and will be discussed in detail in this chapter. 

In our application, the integration of a function / with a given measure \x 
on a manifold M 

Hf) = [ MP) Hp) (4-1) 

is complicated by three facts 

1. the manifold of multi particle phase space and the corresponding Lorentz 
invariant measure 

Mp) = S 4 (E n kn - P) EL d*k n 8{kl - ml) (4.2) 

is non-trivial for three particles in the final state and analytically in- 
tractable for more than three massive particles, 

2. the function is given by a squared matrix element multiplied by kine- 
matical cuts or acceptances 

f(p) = \T(k 1 ,k 2 ,...)\ 2 -C(k 1 ,k 2 ,...) (4.3) 

and usually ill-behaved. Integrable singularities are common in theories 
with massless particles and fluctuations over many orders of magnitude 
are the norm rather than the exception, 
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3. in realistic applications, the acceptances and efficiencies in C(k\, . . . ) 
are almost never given as analytical functions. Instead, they are them- 
selves the result of complicated simulations. 

4 .1 Four Roads from Matrix Elements to Answers 

4-1.1 Analytic Integration 

Analytic integration is mentioned only for completeness. Already for three 
or more particles in the final state with halfway realistic cuts, a fully ana- 
lytical calculation of integrated cross sections is impossible. At least some 
integrations will have to be performed numerically. 

4-1-2 Semianalytic Calculation 

The radiative corrections for LEP1 precision physics on the Z°- resonance 
are dominated by soft photons. The phase space in the soft limit remains 
sufficiently simple so that all but a few integrations can be performed analyt- 
ically for simple cuts. Since programs based on this approach are extremely 
fast compared to the more universally applicable, they play an important 
role in LEP1 phenomenology. Due to their speed and precision, they are 
the preferred tool for the unfolding of electroweak precision observables at 
LEP1. An additional application is to act as benchmarks for other programs. 
The situation at LEP2 and the LC is different, because more phase space for 
hard radiation is available and substantial changes are necessary in these 
programs. 

4-1-3 Monte Carlo Integration 

If only /(/) in ( }4.1|) has to be calculated, it is possible to calculate the es- 
timator E(f), given in (|4.7| ) below, directly, using an appropriate source of 
pseudo random numbers. This is known colloquially as "weighted event gen- 
eration," because the phase space points used for sampling are not equally 
likely, but have a weight attached to them. Since / is not required to be 
bounded for this approach, the application can focus all attention on mini- 
mizing the sampling error ( |4.<j| ). The techniques for achieving this goal are 
well developed and the currently most sophisticated are described in detail 
in section [0[ 
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Figure 4.1: Simulated event e + e~ — > dufiu^ at y/s = 200 GeV ("particle 
identification" from Monte Carlo record). Event generated by WOPPER j&5|/ 
and passed through DELPHI detector simulation 



4-1-4 Monte Carlo Event Generation 
In principle, it is possible to use weighted event generators also in detector 



simulations similar to the one shown in figure |4.1| , as long as the detector 
simulation is equipped with the trivial infrastructure for the bookkeeping of 
event weights. 

However, this approach has the serious drawback that the distribution of 
events that are simulated in the detector, can be different from the realistic 
distributions. In the result of the integration, this is corrected by the weights, 
but the problem remains that the detector is exercised mostly in the region 
of the phase space where few events are expected in the experiment. 

Therefore it is desirable to generated "unweighted events" , i. e. events 
with uniform weight 1, which are distributed exactly as predicted by the- 
ory. For bounded probability distributions, such samples can be generated 
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Figure 4.2: Actual LEP2 event at \/s = 189 GeV: e + e — > qq^iv^ m i/ie 
ALEPH detector. 



by brute force rejection (generate a phase space point x and a uniform de- 
viate y; accept x, if y < p(x)/p ma _ x , else repeat) and if the distribution does 
not vary too much, this method can be sufficiently efficient. Unfortunately, 
most distributions in High Energy Physics (HEP) are singular and require 
sophisticated mapping techniques to handle the integrable singularities (see 
section fO| ). 

4-1-5 Why We Can Not Use the Metropolis Algorithm Naively 

At first sight, the Metropolis algorithm appears to be an attractive alter- 
native to the brute-force rejection algorithm for unweighted event generation. 
The Metropolis algorithm constructs a Markov process covering the phase 
space, which can be shown to converge asymptotically to the desired prob- 
ability density without having to reject a single event. A Markov step is 
accepted with a probability that is equal to the ratio of the probability den- 
sities at the new and the old phase space point. If the move is rejected, the 
same point must be added to the sample again. The advantage of the pro- 
cedure is that probabilities exceeding unity can be treated as if they were 
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unity and probability densities need not be bounded for this agorithm to 
work (see |90|, |9l[ for pedagogical discussions). 

If the probability distribution is rather steep, simple numerical experi- 
ments show that the number of rejected moves is high. Therefore, the num- 
ber of events that will appear more than once in a finite sample will be high 
as well and this causes problems further down the simulation chain. Error 
estimates in detector simulation and unfolding tools might not be prepared 
for finite samples with many duplicate events |92 ] . 

In general, the problem is that, unless the step size and the initial con- 
ditions are chosen (and varied!) carefully, errors can be underestimated 
significantly for steep distributions. Nevertheless, if the problems with ob- 
taining robust error estimates can be gotten under control, the Metropolis 
algorithm could also develop into a powerful part of the calculational tool 
chest for event generators. But until then, a strong caveat emptor regarding 
results obtained in this way is in order. 



4-2 Monte Carlo 

simulate: feign, pretend to have or feel, put 
on; pretend to be, act like, resemble, mimic, 
take or have an altered form suggested by; . . . 

[Concise Oxford Dictionary] 

The overview in the preceding section has shown that weighted and un- 
weighted Monte Carlo methods are the most robust and flexible part of the 
phenomenological tool chest for phase space integration. Therefore they de- 
serve a more detailed discussion. Given a (pseudo-)random sequence of points 
in n-particle phase space 

Pg = {Pi,P2,--- ,Pn}, (4.4) 
where the points are distributed according to 

dfi g (p) = g(p)d/i(p), (4.5a) 

with 

dfi g (p) = 1 and g(p) > (almost everywhere) , (4.5b) 
we get an estimator of 

/(/) = / Mp) f(p) = [ d N {p) ^4 (4-6) 
Jm Jm 9\P) 
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as 



E(f) = 




In ( |4.7[ ), the volume of M is encoded in g, because g is normalized according 
to fl4.5b| ). As written, E(f) depends on g and the particular p g , but the 
dependence on g drops out completely in the ensemble average over all p g . 
Explicitely spelling out the dependence on g is important, because the mea- 
sure dfi(p) (i. e. dfig{p) for g(x) = 1) is in general by no means the one that 
can be generated most efficiently As will be shown shortly, it is also not the 
most useful one. Therefore a discussion should consider a general g from the 
outset. 

For large p g , E(f) will converge to /(/), but when comparing theoretical 
predictions with experimental observations, we have to deal with the more 
complicated problem of estimating the error for a finite sample size and 
minimizing this error. 

Besides optimizing g for this purpose, we could also use a deterministic p g 
derived from quasi random numbers. Under certain conditions, this approach 
can speed up convergence dramatically |93| (from l/y/N to 1/N) at the 
expense of the robustness of the conditions. Therefore we will not proceed 
along this line here and just notice that further research in the area is highly 
desirable. Also it is not clear how to interface deterministic Monte Carlos 
using quasi random numbers with detector simulations. 

An unbiased estimator for the variance of E(f) is given by 



V(f,g) = 




A direct consequence of ( |4.8|) is V(f, f) = 0. This vanishing error is not 
paradoxical, because 1(g) has to be known exactly in order to be able to 
generate p g . For the following, it is useful to note that V(f,g) is a biased 
estimator for W g (f / g) / (N — 1), where the variation W g (h) of a function h 
with respect to the measure fi g is defined as 

W g (h)= [ dn 9 (p) ((h(p)f-(I(h)) 2 ) = [ dfi g (p) (h(p)-I(h)) 2 >0. 

JM JM 

(4.9) 

The advantage of W g (f / g) over V(f,g) is that W g (f / g) does not depend on 
the particular p g , which will simplify the following reasoning considerably. 
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Figure 4.3: In the left graph, Ai(/, g a ) is minimized for a simple step func- 
tion, while in the left graph A 2 {f, ga) is minimized in the same function space, 
^{f, g a ) distributes the mismatch more evenly to achieve faster convergence 
for integration, but has lower efficiency for event generation. 



For Monte Carlo integration, the optimal g will minimize 



in order to maximize the rejection efficiency. If / is a wildly fluctuating func- 
tion, this optimization of g is indispensable for obtaining a useful accuracy. 
Typical causes for large fluctuations are integrable singularities of / or \x 
inside of M or non-integrable singularities very close to M. Therefore, we 
will use the term "singularity" for those parts of M in which there are large 
fluctuations in / or \x. 

If the g could be chosen from a function space that includes /, both A x 
and A 2 would find the same optimal g — /, since Ai(f, /) = and A 2 (/, /) = 
0. However, in practical applications, we have g e G, with a function space G 
that does not include / (otherwise we would already know both /(/) and an 
efficient algorithm to generate phase space points according to ///). There- 
fore, which g G G is closest to / will in general depend on whether the metric 
is A x or A 2 . 



A 2 (f,g) = W g (f/g) 



(4.10) 



while for event generation, the optimal g will minimize 




(4.11) 
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For concreteness, consider the trivial example of f(x) = x 2 on M = [0, 1] 
with a one parameter family of step functions 



g a (x) = a ■ 9(1/2 - x) + (1 - a) ■ G{x - 1/2) (4.12) 

Then 

1 155 1 

A 2 (/,0«) = — — + — — r - - (4.13) 

160 ■ a 160 • 1 — a) 9 



which is minimized by 



a 2 = — = 0.15226.... (4.14) 

1 + V31 



On the other hand, Ai(f,g a ) is minimized by 

a 1 = \ (4.15) 
5 

and there is a noticeable difference, even for smooth functions, as shown in 



figure 4.3. 



From a technical point of view, A2 is more convenient. For a given g e G, 
A^Cf, g) will exist for many interesting functions / for which Ai(/, g) is use- 
less because sup M (f/g) diverges. Furthermore, the estimation of A 2 (f,g) is 
statistically robust, while the estimation of sup M (/ / g) required by Ax(f, g) is 
much more delicate. Finally, the optimality equations for g derived from A 2 (f, g) 
by functional derivation are linear. 

The set G is not defined precisely, it includes all probability densities 
that can be generated "cheaply." This includes all densities for which the 
inverse of the anti-derivative can be computed efficiently, but there are other, 



less well known, cases p4| . Of course, there is no absolute measure for the 
"cheapness" of generating g. Instead, the effort has to be put in context. As 
long as the generation of a phase space point p does not take longer than 
the evaluation of the differential cross section f(p), the generation can be 
considered "cheap." If the variance reduction is substantial, it can be useful 
to use even more expensive g's. 

The manual minimization of either A n (f,g) for a given / can be a te- 
dious exercise, that has to be repeated every time the coupling parameters 
or kinematical cuts in / are changed. For this reason it is economical to use 
adaptive algorithms that minimize A n (f,g) automatically, as long as they 
perform satisfactorily. 
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4-3 Adaptive Monte Carlo 



Manual optimization of g is often too time consuming, in particular if the 
dependence of the integral on external parameters (in the integrand and in 
the boundaries) is to be studied. Adaptive numerical approaches are more 
attractive in these cases. 

In this section, we will use for illustration only integrals over the unit 
hypercube 

xGM = ]0,f (4.16) 

with Cartesian measure 

dfx(x) = dxx A dx 2 A ... A dx n (4.17) 

and Vol(M) = 1. The general case follows by covering the manifold M with 
one or more maps and taking into account the appropriate Jacobians. 



4.3.1 Multi Channel 

The basic question is: how can we implement a variable weight g in dfi g (x) = 
g(x)dfj,(x) efficiently? An obvious option is to employ a weighted sum of 
densities 

n 

d( x ) = ^2 a i9i(%) (4-18) 
i=i 

with 

/n 
dfi(x) gi{x) = 1 and ^^«t = 1 and a« > (4.19) 

8=1 



and to optimize the weights |95|]. This approach has been very successful 
in Monte Carlo integration (e.g. [J7|). However, in applications with 



several hundred different channels gi ]97| , the optimization of the weights a 



according to |)5| yields only modest improvements |98| . It appears to be more 
important that a channel is present, than that the weights are accurately 
tuned. 

In a typical application, the gj are constructed manually from squared 
signal diagrams with simplified spin correlations. The corresponding distri- 
butions can be generated easily by successive integration of the s-channel 
poles and t-channel singularities. The drawback of this approach is that 
the location and the shape of the singularities has to be known analytically. 
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*1 



Figure 4.4: A fixed grid with variable weights can not adapt to singular inte- 
grands. 




x 



Figure 4.5: In one dimension, a variable grid with fixed weights can adapt 
well to singular integrands. 



While the location of the poles is fixed by the particle masses, radiative 
correction and interference terms will modify the shape and shift the ac- 
tual maxima. A particularly serious problem is that the effects of radiative 
corrections depend strongly on kinematical cuts. 

A special case of (|4.18|) is given by using the characteristic functions of 
fixed bins as gi, which corresponds to variable weights for bins in a fixed 
grid, as depicted in figure |4|4]. This case is not useful for applications in 
particle physics, because a grid with a fixed resolution can not adapt to the 
power law singularities from propagators as are ubiquitous in particle physics. 
Differential cross sections typically vary over many orders of magnitude and 
the grid would have to be too fine to be useful. 

In one dimension, there is an interesting alternative to varying the weights 
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Figure 4.6: Quadrangles do not remain convex in general after unconstrained 
shifts of lattice points. 



of fixed width. It is possible to vary instead the size of bins with fixed weight^] 

N Xi- Xi-x 

i=X 

This way it is possible to have bins corresponding to densities that vary 
over many orders of magnitude. It is possible to iteratively improve the 
grid, driven by samplings of f(x)dfi g (x). The improvement can either de- 
crease Ai(/, g) {importance sampling for event generation) or decrease A 2 (/, g) 
(stratified sampling for high precision integration). An optimal g would solve 
the functional equation 

-f A M (/,0) = Ai f2 (/,0) = O. (4.21) 

While it is straightforward to sample A' 12 (f, g) simultaneously with f/g, 
an exact solution of Q4.21|) is not possible, because contributions have to 



be moved from one bin to another. Nevertheless, an approximate solution 



is possible under the assumption that the integrand in (|4.21 ) is piecewise 
constant |99| . 



Unfortunately, there is no immediate generalization to higher dimensions, 
because freely shifting vertices starting from a hypercubic lattice will in gen- 



eral lead to quadrangles that are no longer convex, as illustrated in figure p~6 . 
The concave quadrangles are more than a technical inconvenience, because 
they lead to numeric instabilities of the grid optimization. 



4-3.2 Delaunay Triangulation 

Mathematically, the most natural decomposition of the integration region 
in higher dimension is into simplices. In fact, there have been attempts to 

Varying the weights as well would be possible but adds no new flexibility. It is therefore 
better to avoid the added complexity and to keep all weights identical. 
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Figure 4.7: Delaunay triangulation with Steiner points adapted to a singu- 
larity on a quarter circle. 



use simplical decompositions in adaptive Monte Carlo ||101|| . The results 
of this exercise have not been impressive so far, in particular for strongly 
fluctuating integrands. The reason for the failure of the naive approach is 
that the shape of the simplices is not constrained and that instabilities will 
produce "splinters" that are numerically ill-behaved. 

The construction of "well behaved" simplical decompositions is a rather 
non-trivial mathematical problem (see ||102|| for reviews), that remains un- 



solved in the general case. However, there has been progress in two and 
three dimensions, driven in part by the needs of the engineering community 
for numerically robust finite element codes. The theory for two dimensions is 
rather complete and mathematically very beautiful ||103|| . Therefore it is ap- 
propriate to make a factorized ansatz ||104|| , in which the "least factorizable" 
pairs and triplets are handled by simplical decomposition, using triangles 
and tetrahedra. 

In two dimensions, the most suitable tool are constrained Delaunay tri- 
angulation?, with Steiner points. For a given set of points, the Delaunay 
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triangulation is the unique triangulation in which the center of the circle 
passing through the corners of each triangle is inside of that triangle. This 
uniqueness helps to stabilize the optimization of the triangulation, which is no 
longer unique, if we allow to add additional points, so-called Steiner points, 
to the point set in order to satisfy additional constraints. Nevertheless, there 



are well defined and terminating algorithms ||105|| that construct constrained 
Delaunay triangulations with global and local maximum area constraints, as 
well as minimum angle constraints. The former are what is required for mod- 
eling a function for importance sampling and the latter stabilize the iterative 
refinement of the triangulation. 

However, there is still much more work needed to put these ideas to work 



in particle physics applications beyond a proof of concept | 104 |, because the 
required computational geometry is at or beyond the state-of-the-art in finite 
element analysis for engineering. 

4.3.3 VEGAS 

Factorization 

The previous section has shown that one-dimensional distributions are sin- 
gled out because a mathematically well defined optimization strategy can be 
formulated with very little effort. Therefore it was tempting to propose a 



factorized ansatz 99 



g(x) = gi(x 1 )g 2 (x 2 ) ■ ■ ■ g n (x n ) (4.22) 
and to investigate its properties. It turns out that (4.22) produces satisfac- 



tory results with very little effort and the corresponding code VEGAS [|99] has 
been used as a "black box" by countless physics programs. For two decades 
no significant improvements of VEGAS have been proposed. 

The factorized ansatz Q4.22Q of VEGAS guarantees that the resulting grid 
is hypercubic. Furthermore, the simple one- dimensional update prescriptions 
can be used in each dimension, if the estimators entering the prescriptions 
are averaged over the remaining dimensions. A final benefit of the factorized 
ansatz is that the computational costs grow only linearly with the number 
of dimensions and not exponentially. 

Of course, not all functions of interest in particle physics can be approxi- 
mated by a factorized ansatz. While the functions depicted in figure [4.8| have 
factorized singularities after a simple coordinate transformation, their sum in 
figure |4.9| can not be sampled efficiently by VEGAS. Similar example are ubiq- 



uitous in particle physics, because the cross sections are derived from sums 
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Figure 4.8: Integrands that VEGAS can handle after a coordinate transforma- 
tion. 




Figure 4.9: An integrand that VEGAS can not handle, even after a coordinate 
transformation. 



of Feynman diagrams and have singularities in several kinematical variables 
which can not be used as coordinates simultaneously. 



For example, the three jet production e + e~ — > qqg in figure |4.10| has im- 
portant singularities in both variables si/ 2 = (qi/2+k) 2 - In this case, there are 
parametrizations of the phase space that use both s± and S2 simultaneously. 
However the corresponding Jacobian \d<f)/dx\ contains non-factorizable singu- 
larities in si and S2 induced by a Gram determinant as 1/ ^ A 4 (pi,p 2 , q~[, (72) • 
Thus there is no parametrizations in which both singularities factorize si- 



multaneously and VEGAS can not adjust to both singularities (see [ 1U6| ]). A 



general solution to this problem will be presented below in section 4.3.4 



Stratified vs. Importance Sampling 



The implementation of VEGAS |99|] that is widely used, has three different 



modes: importance sampling, stratified sampling and a hybrid of the two. 
Importance sampling can be selected by the application program, while the 
choice between stratified sampling and the hybrid is done by VEGAS based 
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Figure 4.10: Three jet production e + e — > ggg. 
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Figure 4.11: VEGAS grid structure for importance sampling 



on the dimension of the integration domain. The implementation pSfl very 
terse and hard to understand. Even popular textbooks ||107|| reproduce the 
code without further explanation. The reimplementation that was required 



for ||110|| , aims to be more readable. 

In pure importance sampling, VEGAS adjusts a grid like the two-dimensional 
illustration in figure |4.11| . The sampling proceeds by first selecting a hyper- 
cube at random and subsequently selecting a point in this hypercube accord- 
ing to a uniform distribution. This algorithm generates points distributed 
according to fl£2Cp and flOg) . 

In stratified sampling, faster convergence is attempted by distributing 
sampling points more uniformly than random samples. In VEGAS, stratified 
sampling is implemented by dividing the bins of the adaptive grid further as 
shown in figure |4.12| . The number of divisions is chosen such that the sum 
of two samples per bin approximates the desired total number of samples. 



67 




Figure 4.12: VEGAS grid structure for genuinely stratified sampling. 



Now the sampling proceeds by selecting two random points from each bin of 
the finer grid. This procedure shows better convergence for integration than 
importance sampling. 



As can be seen from table |4.1| , the algorithm for stratified sampling as 
described above can only be used in low dimensions. Already the eight- 
dimensional four-particle phase space would require many billions of sampling 
points, which is usually not possible. 

Nevertheless, the stratified sampling algorithm performs empirically bet- 
ter than the importance sampling algorithm and it is worthwhile to pursue 



the hybrid approach illustrated in figure 4.13. Here the stratification grid is 



no longer a subdivision of the adaptive grid. Instead, the points selected in 



Table 4.1: To stratify or not to stratify: number of calls required for a grid 
of 25 bins in each direction, as a function of the dimension of the integration 
domain. 
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Figure 4.13: VEGAS grid structure for pseudo stratified sampling. 



the stratification grid are mapped linearly onto the adaptive grid and further 
nonlinearly onto the integration domain. The adaptive grid is updated to 
minimize Ai(/, g). This hybrid is, strictly speaking, theoretically not justi- 
fied because importance sampling assumes a random distribution. However, 
the "more uniform" distribution appears to perform better in practical ap- 
plications and the hybrid approach is very successful. 



4.3.4 VAMP 

Introduction 

As mentioned before for the examples in figure |4.<j| , the property of factoriza- 
tion depends on the coordinate system. Consider, for example, the functions 



fi(x 1 ,x 2 ) 

f2(xi,X 2 ) 



(xi - ai) 2 + b\ 
1 



(a/x 2 + x\ - a 2 



(4.23a) 
(4.23b) 



on M = (—1,1) (g) (—1,1) with the measure d/x = dxi A dx 2 . Obviously, 
fi is factorizable in Cartesian coordinates, while f 2 is factorizable in polar 
coordinates. VEGAS will sample either function efficiently for arbitrary 6 12 
in suitable coordinate systems, but there is no coordinate system in which 
VEGAS can sample the sum fi + f 2 efficiently for small b± j2 . 

There is however a generalization [|111|| of the VEGAS algorithm from fac- 
torizable distributions to sums of factorizable distributions, where each term 
may be factorizable in a different coordinate system. This larger class in- 
cludes most of the integrands appearing in particle physics and empirical 
studies have shown a dramatic increase of accuracy for typical integrals. 
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Maps 

The problem of estimating /(/) can be divided naturally into two parts: 
parametrization of M and sampling of the function /. While the estimate 
will not depend on the parametrization, the error will. 

In general, we need an atlas with more that one chart to cover a mani- 
fold M. We will ignore this technical complication in the following, because, 
for the purpose of integration, we can always decompose M such that each 
piece is covered by a single chart. Moreover, a single chart suffices in most 
cases of practical interest, since we are at liberty to remove sets of measure 
zero from M. For example, after removing a single point, the unit sphere 
can be covered by a single chart. 

Nevertheless, even if we are not concerned with the global properties 
of M that require the use of more than one chart, the language of differential 
geometry will allow us to use our geometrical intuition. Instead of pasting 
together locally flat pieces, we will paste together factorizable pieces, which 
can be overlapping, because integration is an additive operation. 

For actual computations, it is convenient to use the same domain for the 
charts of all manifolds. The obvious choice for n-dimensional manifolds is 
the open n-dimensional unit hypercube 

U={0,lf n . (4.24) 

Sometimes, it will be instructive to view the chart as a composition = ip o \ 
with an irregularly shaped P G R n as an intermediate step 



U 




(4.25) 



(in all commutative diagrams, solid arrows are reserved for bijections and 
dotted arrows are used for other morphisms). The integral ( [4.1[ ) can now be 
written 

/(/) = f d n x 
Jo 



dx 



/(*(*)) 



(4.26) 
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and it remains to sample \d<p/dx\ ■ (/o0) on U. Below, it will be crucial that 
there is always more than one way to map U onto M 



(4.27) 



and that we are free to select the map most suitable for our purposes. 

The ideal choice for would be a solution of the partial differential equa- 
tion \dcj)/dx\ = l/(/ o 0), but this is equivalent to an analytical evaluation 
of /(/) and is impossible for the cases under consideration. A more realistic 
goal is to find a such that \d(f)/dx\-(fo(f)) has factorizable singularities and is 
therefore sampled well by VEGAS. This is still a non-trivial problem, however. 
For example, consider again the phase space integration for gluon radiation 
QQ9- From the Feynman diagrams in figure |4.10| we had seen that the 



e e 

squared matrix element has singularities in both variables S\ji = (qi/2 + k) 2 
that can not be factorized. On the other hand, it is straightforward to find 
parametrizations that factorize the dependency on si or s 2 separately. 

Returning to the general case, consider N c different maps 0j : U — > M 
and probability densities gi : U — > [0, oo). Then the function 



is a probability density g : M — > [0, oo) 



90, 



-i 



dp 



(4.28) 



d/i(p) g(p) = l 



(4.29) 



M 



as long as the ^ are probability densities themselves and the a, are properly 
normalized 



I 5fj(x)d n x = l, ^ aci = 1 , 
From the definition ( |4.28| ), we have obviously 

a/) = E^ / ^(^ 7l (p)) 

i=l J M 



<au<l. 



(4.30) 



9p 



gip) 



(4.31) 
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and, after pulling back from M to N c copies of U 



'(/) = !> / 

JO 



g(fa(x)) 



(4.32) 



we find a new estimator of the integral /(/) 

7° 



E(f) 



i=l 



9° 



(4.33) 



If the Qi in ( 4.31]) and ( |4.33| ) are factorized, they can be optimized using the 
classic VEGAS algorithm |Mj unchanged. However, since we have to sample 
with a separate adaptive grid for each channel, a new implementation [|110|| 
is required for technical reasons. 

Using the N% maps = (j)' 1 o fa : U — > U introduced in ( (4.27]) , we can 
write the g o fa : U — > [0, oo) in ( |4.33|) as 



dfa 


-i 


dx 





/ 



3 ° Kijj 



\ 



diTi 



dx 



\ 



J 



(4.34) 



From a geometrical perspective, the maps 7iij are just the coordinate transfor- 
mations from the coordinate systems in which the other singularities factorize 
into the coordinate system in which the current singularity factorizes. 

Even if all factorize, the g o fa will not necessarily factorize, provided 
the fa are chosen appropriately. The maps 7Tjj are purely geometric objects, 
independent of the physical model under consideration. They are the coordi- 
nate transformation from a coordinate system Hj in which the iih singularity 
factorizes to a coordinate system Hj in which the jth singularity factorizes. 

The construction of the ir^ requires some mathematical input, but it is 
required only once and the procedure allows economical studies of the depen- 
dence of the cross section on kinematical cuts and on external parameters 
through /, because VEGAS can optimize the gi for each parameter set and set 
of cuts without further human intervention. 

Note that the integral in ( |4.31| ) does not change, when we use fa : U — > 
Mi D M, if we extent / from M to M, by the definition f(M t \M) = 0. 
This is useful, for instance, when we want to cover (—1, 1) (g) (—1, 1) by 
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both Cartesian and polar coordinates. This causes, however, a problem with 
the 7r 12 in ([4.34 ). In the diagram 



U 




M 



U 




(4.35) 



the injections are not onto and since n\2 is not necessarily a bijection 
anymore, the Jacobian \ditij/dx\ may be ill-defined. But since f(Mi\M) = 0, 
we only need the unique bijections <\)' x 2 and 7r[ 2 that make the diagram 

^2 




Xi 



u 




1-2 



— Mo 



p? 



X2 



(4.36) 



U 2 ?7 



commute. 



Multi Channel Sampling 
Up to now, we have not specified the they are only subject to the condi- 



tions ( |4.30| ). Intuitively, we expect the best results when the «j are propor- 
tional to the contribution of their corresponding singularities to the integral. 
The option of tuning the a« manually is not attractive if the optimal val- 
ues depend on varying external parameters. Instead, we use a numerical 
procedure P5j for tuning the a^. 



We want to minimize the variance (4.8) with respect to the «j. This is 
equivalent to minimizing 

W(f, a) = J g(pW(p) 2 (4.37) 

with respect to a with the subsidiary condition J2i a i = 1- After adding a 
Lagrange multiplier, the stationary points of the variation are given by the 
solutions to the equations 

Vi:Wi(f,a) = W(f,a) (4.38) 

where 

a) = ~^-W(f, a) = / W)d n * (^rPi 2 ( 439 ) 
oai Jo \9{<Pi{. x ))J 
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and 



N c 



W(f,a) = J2^W l (f,a). (4.40) 



It can easily be shown [95] that the stationary points ( |4.38[ ) correspond to 
local minima. If we use 

N t = a t N (4.41) 

to distribute N sampling points among the channels, the Wi(f,a) are just 
the contributions from channel i to the total variance. Thus we recover the 
familiar result from stratified sampling, that the overall variance is minimized 
by spreading the variance evenly among channels. 

The Wi(f,a) can be estimated with very little extra effort while sam- 
pling /(/) (see gag )) 



W,«)= ^ ■ (4-42) 




Note that the factor of gi/g in the corresponding formula in |95| is absent 



from ( 4.42 ), because we are already sampling with the weight g>j in each 



channel separately. 

The equations (|4.38|) are non linear and can not be solved directly. How- 



ever, the solutions of ( f4.38|) are a fixed point of the prescription 



^^ = ^fr¥- (/3>0) (443) 

for updating the weights ctj. There is no guarantee that this fixed point will 
be reached from a particular starting value, such as ctj = 1/N C , through suc- 
cessive applications of ( |4.43j ). Nevertheless, it is clear that ( |4.43| ) will concen- 
trate on the channels with large contributions to the variance, as suggested 
by stratified sampling. Furthermore, empirical studies show that ( |4.43| ) is 
successful in practical applications. The value (3 = 1/2 has been proposed 
in [pql , but it can be beneficial in some cases to use smaller values like (3 = 1/4 
to dampen statistical fluctuations. 

Performance 

Both the implementation and the practical use of the multi channel algo- 
rithm are more involved than the application of the original VEGAS algorithm. 
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Therefore it is necessary to investigate whether the additional effort pays off 
in terms of better performance. 

A Fortran implementation of this algorithm, VAMP [LIU], has been used 



for empirical studies. This implementation features other improvements over 
"VEGAS Classic", most notably system independent and portable support 
for parallel processing (see section |4~3|) and support for unweighted event 
generation. 

There are two main sources of additional computational costs: at each 
sampling point the function g o fa has be evaluated, which requires the com- 
putation of the N c — 1 maps tt^ together with their Jacobians and of the iV c — 1 
probability distributions <?j of the other VEGAS grids (see (|4.34! )). 



The retrieval of the current g^s requires a bisection search in each dimen- 
sion, i.e. a total of O((N — 1) ■ ndi m • log 2 (^div)) executions of the inner loop 
of the search. For simple integrands, this can indeed be a few times more 
costly than the evaluation of the integrand itself. 

The computation of the tt^- can be costly as well. However, unlike the gi, 
this computation can usually be tuned manually. This can be worth the effort 
if many estimations of similar integrals are to be performed. Empirically, 
straightforward implementations of the tt^ add costs of the same order as 
the evaluation of the g^. 

Finally, additional iterations are needed for adapting the weights a, of 
the multi channel algorithm described in (|4.3.4 ). Their cost is negligible, 



however, because they are usually performed with far fewer sampling points 
than the final iterations. 

Even in cases in which the evaluation of increases computation costs 
by a whole order of magnitude, any reduction of the error by more than a 
factor of 4 will make the multi channel algorithm economical. In fact, it is 
easy to construct examples in which the error will be reduced by more than 
two orders of magnitude. The function 



Six) 



37T0(r 3 <I) 27r0(r 2 < 1, |x 3 | < I) 



144atan(l/26) \r§((r 3 - V 2 ) 2 + b 2 ) r 2 ((r 2 - 1/2) 2 + b 2 ) 

+ 8(-K^,<l) j | (4 44) 



with r 2 = \J x\ + x\ and r 3 = \J x\ + x\ + xf, is constructed such that it can 
easily be normalized 



i 

3, 



d i xf(x) = l (4.45) 

-l 
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Figure 4.14: Comparison of the sampling error for the integral of f in (U-44 ) 
as a function of the width parameter b for the two algorithms at comparable 
computational costs. 



and allows a check of the result. The three terms factorize in spherical, cylin- 
drical and Cartesian coordinates, respectively, suggesting a three channel 
approach. After five steps of weight optimization consisting four iterations 
of 10 5 samples, we have performed three iterations of 10 6 samples with the 
VAMP multi channel algorithm. Empirically, we found that we can perform 
four iterations of 5- 10 5 samples and three iterations of 5- 10 6 samples with the 
classic VEGAS algorithm during the same time period. Since the functional 
form of / is almost as simple as the coordinate transformation, the fivefold 
increase of computational cost is hardly surprising. 

In figure |4.14| , we compare the error estimates derived by the classic VEGAS 
algorithm and by the three channel VAMP algorithm. As one would expect, 
the multi channel algorithm does not offer any substantial advantages for 
smooth functions (i.e. b > 0.01). Instead, it is penalized by the higher 
computational costs. On the other hand, the accuracy of the classic VEGAS 
algorithm deteriorates like a power with smaller values of b. At the same 
time, the multi channel algorithm can adapt itself to the steeper functions, 
leading to a much slower loss of precision. 

The function / in ( |4.44| ) has been constructed as a showcase for the multi 
channel algorithm, of course. Nevertheless, more complicated realistic ex- 
amples from particle physics appear to gain about an order of magnitude in 
accuracy. Furthermore, the new algorithm allows unweighted event genera- 
tion. This is hardly ever possible with the original VEGAS implementation, 
because the remaining fluctuations typically reduce the average weight to 
very small numbers. 
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A particularly attractive application is provided by automated tools for 
the calculation of scattering cross sections. While these tools can currently 
calculate differential cross sections without manual intervention, the phase 
space integrations still require hand tuning of mappings for importance sam- 
pling for each parameter set. The present algorithm can overcome this prob- 
lem, since it requires to solve the geometrical problem of calculating the 
maps 7Ty in ( 4.34Q for all possible invariants only once. The selection and 
optimization of the channels can then be performed algorithmically. 



A Cheaper Alternative 

There is an alternative approach that avoids the evaluation of the Qi s, sac- 
rificing flexibility. Fixing the at unity, we have for g : M — > [0, oo) 



E 

i=l 



(X; 



9<t>7 1 



and the integral becomes 



'(/) = !> / 

i=i ^ 
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dp 



g(p) 



dp 



E 



(4.46) 



i=l 



\„ x ./(AW) 







VEGAS can now be used to perform adaptive integrations of 



(4.47) 



h{f) 



g{.<t>i{ x )) 



(4.48) 



individually. In some cases it is possible to construct a set of fa such that h(f) 
can estimated efficiently. The optimization of the weights ctj can again be 
effected by the multi channel algorithm described in ( |4.3.4j) . 

The disadvantage of this approach is that the optimal fa will depend sen- 
sitively on external parameters and the integration limits. In the approach 
based on the g in (4.28), VEGAS can take care of the integration limits auto- 
matically. 



4-4 Parallelization 

The computing needs of the LC can only be satisfied economically if parallel 
computing on commodity hardware can be realized. Traditionally, paral- 
lelization is utilized in HEP by running independent Monte Carlo or analysis 
jobs in parallel and combining their results statistically. Nevertheless, more 
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fine grained parallelism is desirable in Monte Carlo jobs that use adaptive 
algorithms^. 

The problem of the parallelization of adaptive Monte Carlo integration al- 
gorithms, in particular VEGAS, has gained some attention recently ||112| , |113 



These implementations start from the classic implementation of VEGAS and 
add synchronization barriers, either mutexes for threads accessing shared 
memory or explicit message passing. These approaches result in compact 
code and achieve high performance, but their low level nature obscures the 
mathematical structure. Since parallel computing is notoriously difficult, it 
is desirable to have a well defined mathematical model that frees physicists 
from having to waste too much effort dealing with technical computing de- 
tails. Therefore, we suggest a mathematical model of parallelism for adaptive 
Monte Carlo integration that is independent both of a concrete paradigm for 
parallelism and of the programming language used for an implementation. 
We decompose the algorithm and prove that certain parts can be executed 
in any order without changing the result. As a corollary, we know that they 
can be executed in parallel. 

The algorithms presented below have been implemented successfully in 
the library VAMP ||110|| , along with the multi channel algorithm described in 
section [4.3-4 

4-4-1 Vegas 

As described in section |4.3.3| , VEGAS uses two different grids: an adaptive 
grid G A , which is used to adapt the distribution of the sampling points and a 
stratification grid G for stratified sampling. The latter is static and depends 
only on the number of dimensions and on the number of sampling points. 
Both grids factorize into divisions d\ s 

G A = df®df®---®df (4.49a) 
G s = df <g> dl ® • • • <g> d s n . (4.49b) 

The divisions come in three kinds 

df = (importance sampling) (4.50a) 

df = df /m (stratified sampling) (4.50b) 

df 7^ df /m (pseudo-stratified sampling) . (4.50c) 



2 Already at LEP2, there are problems with the parallelization of unweighted event 
generation if the maximum weights have to be found in each parallel job separately p%- 
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In the classic implementation of VEGAS ||99|| , all divisions are of the same 
type. In a more general implementation [|110|| , this is not required and it can 
be useful to use stratification only in a few dimensions. 

Two-dimensional grids for the cases ( |4.50a| ) and ( |4.50b|) have been illus- 
trated in figures 4.11 and 4.12. In case fl£50ap , there is no stratification grid 
and the points are picked at random in the whole region according to Ga- In 
case ( |4.50b|) , the adaptive grid Ga is a regular subgrid of the stratification 
grid Gs and an equal number of points are picked at random in each cell 
of Gs- Since df = df /m, the points will be distributed according to Ga as 
well. A one-dimensional illustration of the most complicated case ( |4.5Uc[ ) has 



been shown in figure 4.13 



4-4-2 Formalization of Adaptive Sampling 

In order to discuss the problems with parallelizing adaptive integration al- 
gorithms and to present a solution, it helps to introduce some mathematical 
notation. A sampling S is a map from the space ir of point sets and the 
space F of functions to the real (or complex) numbers 



S : 7T x F 
(PJ) 



R 

/ = 



S(p, f) 



(4.51) 



For our purposes, we have to be more specific about the nature of the point 
set p G 7r. In general, this point set will be characterized by a sequence of 
pseudo random numbers p G R and by one or more grids G G T used for 
importance or stratified sampling. A simple sampling 

5 , :i?xrxAxFxRxR^i?xrxAxFxRxR 

(p, G, a, f, /ii, /i 2 ) i-> (p\ G, a, f, p[, y! 2 ) = S {p, G, a, f, m, p 2 ) 

(4.52) 

estimates the n-th moments p' n G R of the function / G F. The integral and 
its standard deviation can be derived easily from the moments 



a 



Hi 



N 



— (^2 - nl) 



(4.53a) 
(4.53b) 



while the latter are more convenient for the following discussion. In addition, 
So collects auxiliary information to be used in the grid refinement, denoted 
by a G A. The unchanged arguments G and / have been added to the 
result of Sq in (|4.52|), so that So has identical domain and codomain and can 
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therefore be iterated. Previous estimates p n may be used in the estimation 
of fi' n , but a particular So is free to ignore them as well. Using a little 
notational freedom, we augment R and A with a special value _L, which will 
always be discarded by Sq. 

In an adaptive integration algorithm, there is also a refinement opera- 
tion r : T x A ^ T that improves the grid based on the auxiliary informa- 
tion a G A. r can be extended naturally to the codomain of So 

r: RxTxAxFxRxR^RxTxAxFxRxR 

(p, G, a, f, p i? p 2 ) i-> (p, G', a, /, pi, p 2 ) = r(p, G, a, /, p 1? p 2 ) , 

(4.54) 

so that S = rS is well defined and we can specify n-step adaptive sampling 

as 



S n = S (rS ) n (4.55) 

Since, in a typical application, only the estimate of the integral and the 
standard deviation are used, a projection can be applied to the result of S n : 

P: RxTxAxFxRxR^RxR 
(p,G,a,/,pi,p 2 ) i-> \J,(t) ■ 

Then 



(/, a) = PSo(rS ) n (p, Go, ±, /, ±, J.) (4.57) 

and a good refinement prescription r, such as VEGAS, will minimize the a. 

For parallelization, it is crucial to find a division of S n or any part of it into 
independent pieces that can be evaluated in parallel. In order to be effective, 
r has to be applied to all of a and therefore a synchronization of G before 
and after r is appropriate. Furthermore, r usually uses only a tiny fraction 
of the overall CPU time and it makes little sense to invest a lot of effort into 
parallelizing it. On the other hand, So can be parallelized naturally, because 
all operations are linear, including he computation of a. We only have to 
make sure that the cost of communicating the results of S and r back and 
forth during the computation of S n do not offset any performance gain from 
parallel processing. 

When we construct a decomposition of So and proof that it does not 
change the results, i.e. 

So = tS (f) (4.58) 
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or 

ef =1 Gi ®^ ef =1 Gi 

(4.59) 

G — ^ G 

where is a forking operation and l is a joining operation, we are faced with 
the technical problem of a parallel random number source p. 

As made explicit in ( |4.52| ), So changes the state of the random number 
generator p. Demanding identical results, imposes therefore a strict ordering 
on the operations and defeats parallelization. In principle, it is possible to 
devise implementations of Sq and p that circumvent this problem by dis- 
tributing subsequences of p in such a way among processes that the results 
do not depend on the number of parallel processes. 

However, a reordering of the random number sequence will only change 
the result by an amount on the order of the statistical sampling error, as long 
as the scale of the allowed reorderings is bounded and much smaller than the 
period of the random number generator^. Below, we will therefore use the 
notation x ~ y for "equal for an appropriate finite reordering of the p used in 
calculating x and y" . For our purposes, the relation x ~ y is strong enough 
and allows simple and efficient implementations. 

4-4-3 Multilinear Structure of the Sampling Algorithm 
Since So is essentially a summation, it is natural to expect a linear structure 

(J) S (pi, G u di, f, p 1A , u 2 ,i) ~ S (p, G, a, f, /ii, /i 2 ) (4.60a) 

where 

i 

G = ©G, 

i 

a = 

i 

3 Arbitrary reorderings on the scale of the period of the pseudo random number gener- 
ator have to be forbidden, of course. 



(4.60b) 
(4.60c) 
(4.60d) 
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Ain = (|)/V* (4.60e) 



for appropriate definitions of "©" . For the moments, we have standard ad- 
dition 

Hn,l © l*n,2 = Hn,l + Hn,2 (4.61) 

and since we only demand equality up to reordering, we only need that the pi 
are statistically independent. This leaves us with G and a and we have to 
discuss importance sampling and stratified sampling separately. 

Importance Sampling 

In the cases of naive Monte Carlo without grid optimization and of impor- 
tance sampling, the natural decomposition of G is to take j copies of the 
same grid G/j that are identical to G, each with one j-th of the total sam- 
pling points. As long as the a are linear themselves, we can add them up 
just like the moments 

a% © a 2 = a\ + a 2 (4.62) 
and we have found a decomposition (4.60). In the case of VEGAS, the Oj are 



sums of function values at the sampling points. Thus they are obviously 
linear and this approach is applicable to VEGAS in the importance sampling 
mode. 

Stratified Sampling 

The situation is more complicated in the case of stratified sampling. The first 
complication is that in pure stratified sampling there are only two sampling 
points per cell. Splitting the grid in two pieces as above can provide only a 
very limited amount of parallelization. The second complication is that the a 
are no longer linear, since they correspond to a sampling of the variance per 
cell and no longer of the function values themselves. 

However, as long as the samplings contribute to disjoint bins only, we 
can still "add" the variances by combining bins. The solution is therefore to 
divide the grid into disjoint bins along the divisions of the stratification grid 
and to assign a set of bins to each processor. 

Finer decompositions will incur higher communications costs and other 



resource utilization. An implementation based on PVM is described in 113 



which minimizes the overhead by running identical copies of the grid G on 
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each processor. Since most of the time is usually spent in function evalua- 
tions, it makes sense to run a full Sq on each processor, skipping function 
evaluations everywhere but in the region assigned to the processor. This 
is a neat trick, which is unfortunately tied to the computational model of 
message passing systems such as PVM and MPI. More general paradigms 
can not be supported since the separation of the state for the processors is 
not explicit (it is implicit in the separated address space of the PVM or MPI 
processes). 

However, it is possible to implement (|4.60| ) directly in an efficient manner. 
This is based on the observation that the grid G used by VEGAS is factorized 
into divisions for each dimension 

"dim 

G = 6dD> (4.63) 



and that decompositions of the induce decompositions of G 

"dim 

G l ®G 2 = I <X)D ] ®D\ ® (X) D 3 I © I 00D j ®Di® (S?) D 




2 _ 

3=1 i=J+l 
i-1 n dim 

= (g) D j ® (D\ © Dl) ® (g) D j . (4.64) 

j=l j=i+l 

We can translate (|4.64|) directly to code that performs the decomposition D l = 
D\ © D l 2 discussed below and simply duplicates the other divisions . A 
decomposition along multiple dimensions is implemented by a recursive ap- 
plication of ft4.64|) . 

In VEGAS, the auxiliary information a inherits a factorization similar to 
the factorization ( |4.63| ) of the grid 

a = (d 1 ,... ,cT dim ) (4.65) 

but not a multilinear structure. Instead, as long as the decomposition respects 
the stratification grid, we find the in place of ( 4.64j ) 



a, © a 2 = (d\ + 4, . . . , d\ © 4, . . . , d" dim + 4 dim ) (4.66) 

with "+" denoting the standard addition of the bin contents and "©" de- 
noting the aggregation of disjoint bins. If the decomposition of the division 
would break up cells of the stratification grid, the equation ( |4.66| ) would be 



incorrect, because, as discussed above, the variance is not linear. 
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Figure 4.15: Forking one dimension d of a grid into three parts ds (1), ds (2), 
and ds(3). The picture illustrates the most complex case of pseudo stratified 



sampling (see figure IAS). 



Now it remains to find a decomposition 

D i = D\ © D\ (4.67) 

for both the pure stratification mode and the pseudo stratification mode of 
VEGAS (see figures |4.11| and |4.12|) . In the pure stratification mode, the strat- 



ification grid is strictly finer than the adaptive grid and we can decompose 
along either of them immediately. Technically, a decomposition along the 
coarser of the two is straightforward. Since a typical adaptive grid already 
has more than 25 bins, a decomposition along the stratification grid offers 
no advantages for medium scale parallelization (O(10) to 0(100) processors) 
and the decomposition along the adaptive grid has been implemented. This 
scheme is particularly convenient, because the sampling algorithm So can be 
applied unchanged to the individual grids resulting from the decomposition. 



For pseudo stratified sampling (see figure |4.13| ), the situation is more 
complicated, because the adaptive and the stratification grid do not share bin 
boundaries. Since VEGAS does not use the variance in this mode, it would be 
theoretically possible to decompose along the adaptive grid and to mimic the 
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fork along 3 




fork along 1 




Figure 4.16: Recursively fork a 6 x 5 x 4- grid for distribution among 12 
3x2x2 processors. 



incomplete bins of the stratification grid in the sampling algorithm. However, 
this would cause technical complications, destroying the universality of Sq. 
Instead, the adaptive grid is subdivided in a first step in 



lcm 



/ lcm(n/, 



n„ 



(4.68) 



binsfjj such that the adaptive grid is strictly finer than the stratification grid. 
This procedure is shown in figure |4.15| . The code for forking and duplicating 
a single dimension suffices as building block for general recursion ||110|1 , as 



illustrated in figure 4.16 



4-4-4 Random Numbers 
The implementation [ |110| | of this procedure takes advantage of the ability 



of Knuth's new preferred generator [|114|| to generate provable statistically 



independent subsequences. However, since the state of the random number 
generator is explicit in all procedure calls, other means of obtaining subse- 
quences can be implemented in a trivial wrapper. 

4 The coarsest grid covering the division of n g bins into n/ forks has n g j gcd(n/, n g ) — 
\cm(nf,n g )/rif bins per fork. 
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The results of the parallel example will depend on the number of pro- 
cessors, because this effects the subsequences being used. Of course, the 
variation will be compatible with the statistical error. It must be stressed 
that the results are deterministic for a given number of processors and a 
given set of random number generator seeds. Since parallel computing en- 
vironments allow to fix the number of processors, debugging of exceptional 
conditions remains therefore possible. 



4-5 Dedicated Phase Space Algorithms 

No description of Monte Carlo algorithms for phase space integration would 



be complete without mentioning the beautiful RAMBO algorithm ||115|| for 
massless n-particle phase space. RAMBO proceeds by generating massless 
momentum vectors isotropically, without regard to energy or momentum 
conservation. In a second step, these vectors are boosted with a Lorentz 
transformation to their common rest frame to enforce momentum conserva- 
tion and the lengths of massless vectors can be rescaled by a common factor 
to ensure energy conservation without spoiling momentum conservation. It 



can be shown ||115|| that this transformation has a unit Jacobian. Therefore 
the phase space distribution can be generated without a rejection step. The 
only obvious additional cost is that a n-particle final state uses 4n random 
numbers for 3n — 4 degrees of freedom. 

Unfortunately, the transformation from the An random numbers to the 
3n— 4 independent variables has a substantial hidden cost: the relation of the 
integration variables to the kinematical variables in which matrix elements 
have singularities is lost and adaptive Monte Carlo algorithms are almost 
useless. For this reason, RAMBO is mainly useful for smooth background 
distributions. There is a modification (called MAMBO) for light massive 
particles JTTE , but the Jacobians are now no longer unity and the resulting 



event weights make the algorithm not very efficient. 
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— 5— 
Linear Colliders 



Wie Bohmen sie bestand, das einen 
schdnen Tags zum Meer begnadigt wurde 
und jetzt am Wasser liegt. 
[INGEBORG BACHMANN: Bohmen liegt am Meer] 



The marquee measurement in the area of gauge boson physics at a linear col- 
lider is without a doubt the precision determination of their self-couplings, 
known as Triple Gauge Couplings (TGC). As has been stressed in chapter ||, 
the measurements a LEP2 || (see figure |1.4j ), have been a very important 
confirmation, but can not have been surprising, because a consistent de- 
scription of the low energy data leaves little room for deviations from the 
SM predictions at a level that would be detectable at LEP2 [EJ . 

As will be shown in section |5.1| , this is qualitatively different at a LC, 
because higher energy and luminosity allow to probe the couplings at a level 
where deviations due to different realizations of EWSB are possible. 

5.1 Triple Gauge Couplings 

The most popular parametrization of the most general Lorentz invariant 
W + W~ Z°- and W + W~ r y- vertices has been proposed by Hagiwara et al. [|1 1 7 



Lwwv/ 9wwv 



- gXW^Wu (&>V + d v V») + gl^ xp (Wld x W u - cW^) V p 



lvl w 
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where V is either Z° or 7, while gww-y = — e and gww/z = — e cot are 
the SM couplings. All field-strengths tensors in ( |5.1|) are abelian V^ v = 
dfjVn — d v V^ with the tilde denoting the dual tensor. At tree level in the SM, 
the parameters in ( |5.1| ) take the values 

(5.2a) 
(5.2b) 
(5.2c) 
0. (5. 2d) 

Electromagnetic U(1)q gauge invariance dictates gj 5 = at zero momentum 
transfer. The parametrization ( |5.1| ) is calculationally convenient, because 
the parameters translate directly to coefficients of the same size in the Feyn- 
man rules. Therefore, the matrix elements used in most Monte Carlo event 
generators are expressed in terms of (|5.1|). 

Another advantage of the parameters in (|5.1| ) is that they translate im- 
mediately, without unnaturally small or large coefficients, to model indepen- 
dent physical observables for the electromagnetic properties of charged vector 
bosons, such as the magnetic dipole 

Hw 

electric quadrupole 

Qw 

electric dipole 

d w 

and magnetic quadrupole 

Qw 

moments of W ± bosons ||118|| . 

On the other hand, ( |5.1|) does not offer any indication to the probable 
size of the individual contributions. Completely model independent fits of all 
14 parameters simultaneously are not possible, even at a high luminosity LC. 
Furthermore, the parameters in ( p.l[ ) are not necessarily constant and could 
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Figure 5.1: Sensitivity on the parameters of a non-linearly realized 
EWSB sector with J C = 200 fb" 1 and ^ = 800 GeV RUTtf . The vertical 



bars indicate the constraint on x\$ derived from LEP1 analyses \12QJ . The 
blind direction (2 sin 2 8 W , —2 cos 2 8 W , cos 2 9 W — sin 2 9 W ) does not contribute to 
the triple gauge couplings. 



be polynomials in the invariant masses or even general form factors. Obvi- 
ously, , kvi and Ay are CP- violating couplings and their study could be 
postponed. 



A more systematic approach starts from a basis ||120|| for the opera- 
tors that can appear in an SU(2) L ® U(1)y-/U(1)q effective Lagrangian and 
classifies them according to their order in the momentum expansion. For 
non-linear realizations, NDA provides the necessary hierarchy. A systematic 
study can then start from the lowest dimensional operators and can incor- 
porate symmetries like the custodial SU(2) C , which is strongly suggested by 
the low energy electroweak precision data. 

The operators of dimension four in the momentum expansion contributing 
to the TGC are IT23 
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Figure 5.2: TTie sensitivity ellipses of figure \5.1\ rescaled to the new high 
luminosity TESLA with f C — lab -1 . 



L = -i 



16tt 2 



ti(gW^DJJ ] D v U 



tr (g'B^D^D u u) + tr (u^g'B^UgW^) . (5.4) 



The contributions of the interaction ( |5.4p to the TGC can be expressed in 
terms of the general parametrization (|5.1|) by comparing coefficients[] of the 
trilinear terms 



+ 



2 sin 2 0,„ cos 2 6L 167r 2 cos 2 6L(cos 2 6L — sin 2 167r 2 
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2e 2 



2 sin 2 9 W 16n 2 2 cos 2 6L 167r 2 cos 2 6L sin 2 6L 167r 2 



Xg r 



2 sin 2 0„, 16tt 2 2 sin 2 0,„ 16vr 2 sin 2 6 W 16tt 2 ' 



(5.5a) 
(5.5b) 
(5.5c) 



lr The erroneous non-singular transformation formula in [123 is corrected for the special 
case Xgi — x 9r in [124 and for the general case in |125}|. 
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Unfortunately, the transformation ( |5.5| ) is singular and the blind direction 







x 9r 


1 «( 


\xioJ 


blind 




(5.6) 



in the parameter space can not be observed in measurements of the TGC 
alone. However, LEP1 analyses (e.g. [|122|| ) provide independent constraints 
on x 10, which can lift the degeneracy. 

The coefficients in ( |5.4|) have been normalized such that their natural size 
according to NDA (see sections ^T7\ and |A.3.5| ) is of order one. Consequently, 



the all important question from a physics perspective is: Are experiments at 
a LC sensitive to values for x 9 i, x 9r , x w of order one? 

Detector effects appear to be small and preliminary studies can therefore 



work on the generator level with simple geometrical acceptance cuts ||126 
Much more important than detector effects are the distortions of the dis- 
tributions from hard, almost collinear initial state photon radiation, when 
the photon is lost in the beam-pipe. Since these effects cause a migration of 
events in the phase space, simple acceptance corrections do not suffice and 
more complicated unfolding procedures are required. 

In one study [ |121|| (see also 0) the unfolding has been performed by 
generating a large sample of Monte Carlo events (ten times the size of the 



expected data sample at TESLA) with WOPPER |85j and summing over all ra- 
diative and non-radiative events with the same experimental signature. The 
events were subsequently reweighted with anomalous TGC using fast matrix 
elements of WPHACT g2f] to obtain the likelihood function for the TGC. Us- 
ing a quadratic approximation of the likelihood function for the "old" high 
luminosity option at TESLA with / £ = 200 fb" 1 and = 800 GeV, the 
sensitivities in figure |5.1| have been derived. A more careful analysis will have 
to take non-linearities in the likelihood function into account^, but they can 
not change the conclusion that the LC is definitely sensitive to values for xgi, 
x 9r , x w of order one, if constraints on x 10 are taken into account. 

Increasing the integrated luminosity according to the "new" TESLA high 
luminosity option by a factor of five to J C = 1 ab _1 , as shown in figure 5~2". 



shows that such a machine would allow precision tests of the TGC, at the level 
of 10% of reasonable deviations from the SM, caused by different realizations 
of the EWSB sector. 



2 The non-linearities had been neglected because the unfolding would have exceeded the 
limited computing resources in a non- linear fit. 
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e+ 



Figure 5.3: "Single-W" contribution to e + e — > e uud. 



5.2 Single W ± Production 

At energies that are high compared to the masses of all participating particles, 
all cross sections with a finite limit for massless particles have to fall off as 
dictated by dimensional analysis 

a(s) oc - . (5.7a) 
s 

Cross sections that have contributions from the exchange of light particles 
in the t-channel do not necessarily have a finite limit and can diverge in the 
forward direction if the mass of the exchanged particle is send to zero. Such 
cross sections can scale like 

a(s)oc^lnQ^) , (5.7b) 

where fi 2 denotes a typical kinematical scale set by a minimum momentum 
transfer. 

For the t-channel neutrino exchange in pair production, the forward 
singularity is cut off kinematically by the W -mass and the cross section 
is still dominated by ( |5.7a| ) in the TeV-region. The situation is different 



for "single-jy production as shown in figure |5.3| , because the electron or 
positron that goes undetected in the forward direction does not provide a 
kinematical cut-off. The observable cross section is still small at LEP2, but 
it will be larger at a LC than that of W ± pair production. 

A unique physics opportunity of single-iy production in the context 



of TGC is that, unlike W ± pair production, the diagram in figure |5\3 



is 



sensitive to the iy + jy~7-couplings alone, because the Z°-exchange is not 
singular. Therefore it can be used to disentangle W + W~ Z°- and W + W~'y- 
couplings without studying 77-collisions in a dedicated collider. 
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For LEP2, W ± pair production has been studied extensively ||129| and 
many reliable Monte Carlo event generators have been prepared [[130]. A 
more recent dedicated comparison of theoretical predictions for single- W pro- 
duction has revealed that the situation is less favorable for single-PU produc- 
tion. At LC energies, unacceptable variations of the predictions for the total 
cross section on the order of up to five percent have been reported ||131| , |132| . 



It appears that these discrepancies can be traced back to incomplete imple- 
mentations of effects from the finite electron mass. 

The lesson to be learned is that a simple translation of the electron mass 
into a lower cut-off on the electron scattering angle will not suffice at a LC 
and more complete calculations in the forward region are required, if single- 
W production is to be used for precision physics, like the disentangling of 
W + W~Z°- and W+W^-couplings. 

The classification of the gauge invariant classes of Feynman diagrams 
for single-jy production (see figure [3.7[ ) allows to improve the handling of 
radiative corrections. The single-jy and pair production contributions 
to the same four fermion final state can be folded with a separate radiator 
function, without spoiling gauge invariance. Since the two classes have very 
different characteristic scales and the interferences are small, this improves 
the predictions considerably. 



5. 3 Beamstrahlung 



Another issue that differentiates a LC from LEP2 is the non-trivial beam en- 
ergy spectrum caused by beamstrahlung. Unlike initial state radiation, beam- 
strahlung is not characterized by the scale of the hard interaction, but by 
the collective interaction of the crossing bunches, which must be much denser 
than the bunches in storage rings to obtain sufficient luminosity. Therefore, 
beamstrahlung is a more complex physical phenomenon, but it is indepen- 
dent of the hard interaction under study and can be parametrized in a single 
universal beam spectrum for each collider design at each beam energy. 



The comparison of analytical approximations [ 133 1 with the results of 



microscopic simulations [|135|, |136|1 reveals that the former are not adequate. 



A successful pragmatical solution is to parametrize the results of the micro- 



scopic simulations with a simple factorized ansatz 1 137 1 

Dp lP2 (xx,x 2 ,s) = d Pl (x 1 )d P2 (x 2 ) (5.8) 
where x e ± a = E e ± t7 /Ei, eam and 

d e ± (x) = a 5(l -x) + a x x a2 (1 - x) az (5.9a) 
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e + e + /V 
Figure 5.4: Vector boson scattering subprocess in six-fermion production. 

d 1 (x)=a A x a5 (l-x) a6 . (5.9b) 

For a ,i,4 > and 02,3,5,6 > — 1> the ansatz ( |5.9| ) corresponds to a positive 
and integrable distribution can reproduce the soft singularities from multiple 
emission at x e ± —>■ 1 and x 7 — > for 03.5 < 0. 

The ansatz (|5.9|) is able to fit the results of microscopic simulations 



well [137]. Fits for the latest collider parameter sets are available as distri- 



butions and as efficient random number generator in the library circe ||138| 



5.4 Quadruple Gauge Couplings and 
W ± Scattering 

While LEP2 is, for gauge boson physics, essentially a four fermion production 
machine, a LC can also produce final states with six or more fermions, which 
have contributions from quadruple gauge boson couplings. In addition to the 
coupling of an s-channel Z° or 7 to three vector bosons, there are the kine- 
matical configurations corresponding to the fusion of almost on-shell vector 
bosons, as shown in figure [5.4| . This part of the phase space is particularly 
attractive for studying the interactions of strongly interacting vector bosons. 

Realistic estimates for the physics reach in this channl have already been 
obtained using an improved equivalent particle approximation for the "pro- 
duction" of the vector bosons in the intermediate state [ |139|| (see also ||). 



Interesting physics can be probed, but a high luminosity option is required. 

The construction of a complete unweighted event generator for e + e~ — > 
6f, including general gauge boson couplings, is still a challenge. Complete 
calculations for selected channels exist | 140 . |141|| , which are expected to be 



reliable in the regions of the phase space corresponding to top pair production 
and Higgs production. More work is required for a proper treatment of the 
part of the phase space that corresponds to the scattering of almost on-shell 
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W^'s as in figure |5!^ . Currently, work is underway that uses the Monte Carlo 
methods described in section |4.3.4| to bridge this gap. 
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Conclusions 



While it is never safe to say that the future of 
Physical Science has no marvels even more 
astonishing than those of the past, it seems probable 
that most of the grand underlying principles have 
been firmly established and that further advances are 
to be sought chiefly in the rigorous applications 
of these principles to all the phenomena which 
come under our notice. It is here that the science 
of measurement shows its importance — where 
quantitative results are more to be desired than 
qualitative work. An eminent physicist has remarked 
that the future truths of Physical Science are to 
be looked for in the sixth place of decimals. 

[Albert Michelson] 

There is ample model independent evidence from low energy data that weak 
interactions exchange vector and axial-vector quantum numbers, coupling to 
charged and neutral currents. 

The attempt to incorporate these interactions in a renormalizable QFT 
in which perturbation theory is well defined without any short distance cut- 
off has singled out spontaneously broken gauge theories in which the weak 
interactions are described by the exchange of massive gauge bosons. The 
same conclusion is reached by attempting to enforce high energy tree-level 
unitarity of scattering amplitudes. 

As discussed in chapter || (with a little help from appendix |A|), gauge 
theories retain their special status, even if the electroweak SM is interpreted 
as an EFT, which may have to be replaced by a more comprehensive theory 
at a energy scale beyond the physics reach of current colliders. Instead of 
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guaranteeing the consistency of formal[| perturbation theory to arbitrarily 
high energies, the improved high energy behavior is used for organizing the 
expansion of the effective interaction in a more systematic way than would be 
possible with non-gauge vector bosons. Selecting the gauge theory among all 
equivalent EFTs that describe the same physics (on-shell matrix elements) 
does not sacrifice any generality. 

Chapter |3| described how to systematically organize tree-level calculations 
for many particle final states in such a way that non-abelian gauge invariance 
can be maintained by partial sums. This is important in state-of-the-art cal- 
culations, that would exhaust available human and computing resources if 
they had to be done in a single step. In chapter |], methods for improving 
numerical convergence and parallelization of computations have been pre- 
sented, that can be used for obtaining high precision theoretical predictions 
for event rates in channels with many particles in the final state, as discussed 
briefly in chapter [5| 

Currently, a pessimistic scenario can not be excluded, in which a fun- 
damental Higgs particle will be found, together with a few supersymmetry 
partners of SM particles, compatible with the MSSM. While these discoveries 
would be a tremendous achievement of experimental physics and the culmi- 
nation of decades of theoretical work, it would probably put the answers 
to the open questions in the physics of flavor out of our reach. However, 
we must not forget which eminent "future truths of Physical Science" were 
found when Michelson had a look at "the sixth place of decimals. " Rumors 
about the completion of physics are almost always greatly exaggerated . . . 



I. e. without regard to the convergence of the perturbative series. 
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Appendix 
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— A— 

The Renormalization Group 



Wisse, dafi dies Universum das ist, was es zu 
sein vorgibt: ein Unendliches. Versuche nie im 
Vertrauen auf deine logische Verdauungskraft, 
es zu verschlingen; sei vielmehr dankbar, 
wenn du durch geschicktes Einrammen 
dieses oder jenes festen Pfeilers in das 
Chaos verhinderst, dafi es dich verschlinge. 
[Thomas Carlyle: The French Revolution^ 



Until a few year back, textbooks on elementary particle physics and QFT 
have hardly mentioned the method of EFT and have usually given treatments 
of the RG that are have been more formalistic than lucid. Consequently, 
these subjects have had more of an oral tradition than canonical scriptures. 
Fortunately this has changed now with the advent of two excellent textbooks 
offering both a pedagogical |26| and a systematical [^4], |25| exposition of QFT. 

These lectures have been given to an audience of graduate students work- 
ing in experiments and on theoretical projects. Therefore they had to live 
up to the combined challenge of presenting theorists new angles on a subject 
they had already studied, while at the same time offering concrete applica- 
tions to experimentalists, without burying them in formalism. 

The intuitive reasoning in these lectures avoids the horrendous combi- 
natorial complications of perturbative renormalization theory. Nevertheless, 
mathematical proofs of the intuitive observations still need the technical re- 
sults of traditional perturbative renormalization theory. 

These lectures deliberately use a strictly perturbative language in order 
to appeal to the audience's intuition that is trained on Feynman diagrams. 
Many results remain valid outside of PT, but no attempt is made to mention 
this systematically. 



^Translation and quote from Thomas Mann: Betrachtungen eines Unpolitischen. 
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Figure A.l: One-loop contribution to e + e — > qq. 

A.l Introduction 

Ein Teil von jener Kraft, 
Die stets das Bose will und stets das Gute schafft. 

[J. W. v. Goethe: Faust I] 



What is the size of leptons and quarks? According to the most recent report 
of the Particle Data Group (PDG) ]EJ, the compositeness scale is larger 
than 1.5 TeV for all of them, but even the best limits fall short of 5TeV. In 
the SM, we assume all fermions to be strictly pointlike. As you all know, 
the SM is very successful. Yet, we can't pin down the size of its constituents 
below 0.2 TeV -1 . How is this possible? 

The easy way out is to argue that our accelerators are too small to probe 
any deeper. This is too easy, however, because we claim to control the 
SM at the quantum (i.e. one-loop) level. The description of a typical LEP1 
experiment e + e~ —>■ qq involves loop diagrams like the one in figure |A.1| . And 
in this diagram, the loop momentum k is not bounded anywhere and there is 
a part of the integration region where it exceeds the energy scale A ~ 1/ Ax. 

At this point we can rephrase the question: Why is the compositeness 
scale irrelevant to physics at LEP1 energies? The answer will be that the 
compositeness interactions 

AL= 2^W^ (A ' X) 

are indeed irrelevant and the term "irrelevant" will be given a precise tech- 
nical definition below. This might be a bit surprising, because most of us 
have learned in graduate school that interactions like ( |A.1| ) are verboten in 



loop calculations, because they make the theory non-renormalizable. 

A closely related question is: why don't we have to understand M-theory 
to do phenomenological particle physics ? According to the theoretical party 
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Figure A. 2: Electron self energy 



line, we expect a lot of new particles (grand unification gauge and Higgs 
bosons, higher string and membrane modes, etc.) somewhere above 10 16 GeV. 
Why do they have so little impact on our bread and butter phenomenology 
that involves loop diagrams? 

A. 1.1 History 

QFT was introduced shortly after Quantum Mechanics (QM) in the 1920s. 
But while QM was put on a solid mathematical basis in the 1930s, it took 
much longer to make sense out of QFT. 

Soon (in the early 1930s), it was discovered that the second order of PT 
gave non-sensical results in QED. For example, the electron self energy (see 
figure |A.2| , but the calculators in the 1930s had to work without the benefits 
of Feynman diagrams, of course) came out infinite. Even though the lowest 
order predictions were very successful, the inability to calculate corrections 
was unsettling, to put it mildly. By the late 1940s, a workable scheme was 
established for absorbing the infinities in renormalized couplings constants 
and masses. Nevertheless, this process of renormalization was considered a 
nuisance and it was expected that a complete theory would not need it. 

Even though the renormalization procedure apparently worked in practi- 
cal applications, it took another twenty years of hard theoretical work until 
a mathematical proof that the procedure works in all orders of PT was es- 
tablished in the 1960s. 

Today, the formalism of perturbative QFT is well established and the 
electroweak precision measurements at LEP1 leave no doubt that its results 
are phenomenologically sound. Nevertheless, it should be remembered that 
no mathematical proof for the existence of an interacting QFT in four space- 
time dimensions has been given that works outside of the realms of PT. 
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Figure A. 3: Vacuum polarization 

A. 1.2 Renormalizability 

Let us look at a simple example to get a feel for the physics of renormalization. 
For the vacuum polarization diagram of QED (see figure |A.3|) 



iU^p) = -t(p 2 g^ - p^Mp 2 ) (A.2) 

we find a logarithmically divergent integral over k. Obviously, the divergence 
comes from the region —k 2 —>■ oo, where the ee7-vertex has not been tested 
experimentally. A brute force remedy is to cut-off the integral at an arbitrary, 
but very large, scale A 

~ , 2N a f A2 d(-k 2 ) „ a , A 2 . . 

U(p 2 ) = - J + fimte = _ ln __ + fimte (A .4) 

This leaves us with a logarithmic dependence on the unphysical parameter A 
and is not acceptable in this form. We can however impose a renormalization 
condition 

% 2 ) 2 2 =0 (A.5) 

that trades the dependence on the unphysical cut-off A for the dependence 
on the physical renormalization point [i at which we define the charge. Even 
though the result is now finite 

nV) = -f (a.6) 

we are faced with a potentially large logarithm ln(p 2 //i 2 ), that reaches 25 
for LEP2 energies (y/s ~ 190 GeV) with a charge renormalized at low en- 
ergies (/x ~ m e ). The small coupling constant of QED helps to keep this 
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Figure A. 4: Vacuum polarization counterterm 



logarithm relatively harmless, but since LEP1 has introduced the era of elec- 
troweak precision physics, higher orders in PT are required for reliable quan- 
titative predictions. We will see below, that there are cases, where the effec- 
tive coupling is much larger and a resummation of the perturbation series is 
required. 

It is far from obvious that the subtraction procedure, that I have sketched 
so crudely, works to all orders in PT. It is not even obvious that all re- 
quired subtractions correspond to a renormalization of coupling constants 
and masses. While I will not go into detail, it will be helpful later to estab- 
lish some nomenclature at this point. 

The subtraction leading from ( A.4 ) to (|A.6| ) is obviously equivalent to 



adding the new counterterm vertex depicted in figure |A.4| to the theory. 
Thinking in terms of counterterms instead of subtractions has the advantage 
that the generalization to higher orders is more intuitive. 

In order to get a feeling for the required counterterms, let us consider the 
free field actions for scalars 0, spin-1/2 fermions if), and vectors 

Li = J ^ ^ U ( X )F^( X ) , F^x) = ^ - ^1 . (A.Tc) 

We are using units with H — c—1 and therefore the actions must be dimen- 
sionless. If we assign dimension +1 to mass 

dim(m) = 1 , (A.8) 

then length has dimension —1 in these units 



dim (d 4 x) = -4 (A.9a) 
' d_ 

dx. 



dim ( ) = 1 . (A.9b) 



" I* 
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Now we can read off the dimensions of the fields from ( |A.7| ) immediately 

dim(>(a;)) = l (A.lOa) 
3 

dim (^(ar)) = - (A.lOb) 



1 

3 
2 

dim (^(z)) = 1. 



(A.lOc) 



These dimensions are reflected in the high energy behavior of the propagators, 
which is l/p 2d ~ 4 



<Txe ipx (O|T0(z)0(O)|O) 



p 2 — mi + ie 



d 4 xe ip:c <0|T^(x)V(0)|0) =i 



d 4 xe^(0|T^(o;)^(0)|0) = - . 

p z + ie 



p 2 - m 2 b + ie 



(A.lla) 
(A.llb) 
(A.llc) 



The power counting for loop integrals in Feynman diagrams can be related 
directly to the dimensions of the interaction operators. Let me illustrate this 
by a some examplesQ I will give a more comprehensive account from the 
point of view of the RG in later sections. 

Consider a one-loop integral with two 6 insertions. Since the loop inte- 
gral 



d 4 fc 



(2tt) 4 k 2 (p-k) 2 
is logarithmically divergent, we need a 8 counterterm 



(A.12) 




+ 




finite . 



(A.13) 



Similarly, a <p A and a <f) 6 operator need a 6 counterterm 




+ 




finite 



(A.14) 



2 A systematic exposition too all orders in PT is well beyond the scope of these lectures, 
but can be found in any QFT text. 
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and two 4 operators need a 4 counterterm 

+ ^)"\^ = finite. (A. 15) 

This suggests a pattern, which can indeed be verified by a more detailed 
analysis. If we have multiple insertions of operators with dimension greater 
than four, then counterterms of higher and higher dimension are needed. On 
the other hand, operators of dimension less than or equal to four require 
counterterms of dimension less than or equal to four. 

This pattern can already be seen from dimensional arguments. If an 
operator has dimension greater than four, it must come with a coupling 
constant of negative dimension, e.g. 

~0 6 (x), dim(A) = l. (A.16) 

If the loop integral does not depend on the scale in this coupling constant, 
then the product of two such operators will have a coupling constant of an 
even more negative dimension. The product will therefore be of even higher 
dimension, e. g. 

Since all building blocks have a positive dimension (see (|A.9b| ) and ( |A.10| )), 
there are only a finite number of operators of a given dimension. Therefore, 
interactions of dimension less then four can be renormalized by a finite num- 
ber of counterterms^. As long as a finite number of counterterms suffices, 
the theory retains its predictive power. As a result, interactions of dimen- 
sion four are called renormalizable and interactions of dimension less than 
four are called super-renormalizable. There are additional complications with 
the preservation of symmetries, which do not affect the general argument, 
however. 

It was then very reasonable to expect that nature is described by a renor- 
malizable QFT, such as QED, QCD or the SM. These theories have been 
very successful and as a result, all other — non-renormalizable — theories fell 
from grace and became second class citizens. 

However, two very important questioned were apparently never posed, let 
alone answered: 

3 While this is rather intuitive from the one-loop examples, the mathematical proof is 
extremely complicated in the general case. It took forty years of efforts |73| of some of 
the brightest theoretical physicists of their time to disentangle the multi-loop calculations 
when the loops are not simply nested, but are overlapping. 
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1. Why is nature described by a renormalizable QFT? I will show later 
that the RG will provide a satisfactory answer to this question. 

2. Why should our low energy QFT be correct to arbitrarily high energies? 
The contribution of string theory to particle physics that this question 
is no longer taboo. 



A. 1.3 The Renormalization Group 

The great Cinderella story of theoretical particle physics since 1970 has been 
the rise of renormalization from a nuisance (sweeping things under the rug) 
to a powerful tool. The turning points of this epic are 



1971: Wilson's RG [27 for critical phenomena (i.e. phase tran- 



sitions) and strong interactions proved that renormalization 
could help physics insight, instead of hindering it. 

1972: The proof of renormalizability of spontaneously broken gauge 
theories |]68| , |143|| established a viable candidate for a QFT of 
weak interactions (now known as SM). 



1973: Asymptotic freedom ||144|| paved the way for QCD, the QFT 
of strong interactions. 



1979: Weinberg's EFT paper [146] summarized and formalized the 



folklore of systematic expansions in the energy. 

1990s: Even the non-renormalizable theories have been reinstated 
as first class citizens under the name of EFT. 

By the time of Weinberg's paper, the physics and the formalism was well 
understood by the experts, but it remained for some time part of the oral 
tradition that was covered insufficiently by textbooks. 



A. 2 The Renormalization Group 

caminante, no hay camino 
se hace camino al andar. 
[Antonio Machado: Proverbios y cantares, VI] 

We evaluate the predictions of QFT by doing integrals: loop integrals from 
Feynman diagrams in PT and Feynman path integrals beyond PT. The RG 
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has a very intuitive interpretation ||147|| (see also ||148|| ) in the evaluation of 
Feynman's path integral 



V<pe lbW (A.18) 

and I encourage everyone to consult the reference. But I can not assume a 
previous exposure to ( |A.18| ) and a detailed discussion is beyond the scope of 



these lectures. Therefore I will concentrate on Feynman diagrams and PT. 

A. 2.1 Cracking Integrals 

There are (at least) three common ways to do integrals in physics: either 
from the anti-derivative or from the theorem of residues or from a Riemann- 
(or Cauchy- or Lebesgue- or . . . ) construction. The first method is familiar 
to all of you from high school days, but it is frowned upon in certain circles 
because the second method enables physicists to do "harder" integrals and 
the third method allows mathematicians to integrate more (i. e. "wilder" ) 
functions. As one might guess from this introduction, I will now sing the 
praise of the first method. 

The problem of calculating the integral 

I(f;a,b)= [ dxf(x) (A.19) 



is equivalent to solving the first order Ordinary Differential Equation (ODE) 

±F(x) = f(x), F(b) = C, (A.20) 

since the integral is given by /(/; a, b) = F(a) — C . This relation is familiar, 
because it is used in simple cases in the opposite direction to obtain solutions 
to ( |A.20fl from ( |A.19| ). In fact, we will use it frequently below 



Solving the ODE (|A.2Q ) might appear more difficult than doing the in- 



tegral ( A.19 ) and this is true in the general case. However, I will try to 
convince you that, for many applications in particle physics, the ODE can 
give us very reliable approximate results, where the value of the correspond- 
ing integral is hard (or even impossible) to estimate. In addition, the ODE 
can give us more physical insight than brute force estimates of integrals. I 



will also argue that there are cases, where the integral (|A.19 ) does not exist 



while the ODE (|A.20|) can still give us the physically correct answer. 
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Figure A. 5: Nested divergencies 



Figure A. 6: Overlapping divergencies 



A. 2. 2 Wilson's Insight 



We have seen heuristically in section |A.1| that the divergencies and large 



logarithms in PT originate from logarithmic integrals in loop momenta. For 
nested divergencies (see figure pO|) , it is intuitively clear that the proper pro- 



cedure is to proceed from the inside out and to subtract the subdivergencies 
first. Then only the overall divergency from the outermost loop remains. 

More intricate is the treatment of overlapping divergencies (see figure [Al6| ) , 
where each subdivergency has to be subtracted without double counting. The 
proof that the subtraction algorithm is mathematically sound to all orders 
in PT is an extremely complicated combinatorial exercise |73| . 



Nevertheless, it can be given a lucid physical interpretation, if we change 
our point of view. Why should we insist of doing the loop integrals first and 



subtract divergencies later? As we have seen in (|A.4|) and ( |A.6| ), the leading 
logarithmic dependence on the renormalization scale can be recovered from 
the dependence on the cut-off by dimensional analysis. Wouldn't it be easier 
to calculate the dependence on the cut-off directly? 

As we shall see now, this heuristic argument can be made precise and 
results in a powerful calculational tool, the RG. The essence of the method 
lies in deriving a differential equation, the Renormalization Group Equation 
(RGE), that can be integrated to give the desired result. It will turn out 
that the calculation never involves any infinite numbers and in most cases, 
the coefficients of the PT will never be large. 
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Figure A. 7: Wick rotation 



A. 2. 3 The Sliding Cut- Off 

For simplicity, I will now concentrate on massless 4 -theory. This is the 
theory of a spinless, uncharged, massless particle, that interacts through a 
local 4> vertex. The arguments will go through for more realistic QFTs, but 
the technical complications arising from gauge invariance would obscure the 
point that I'm trying to make. I will deal later with the effects from massive 
particles. 

Let us assume for the moment that the theory has been defined with a 
cut-off in momentum space, i. e. a propagator 

z D A (k 2 ) = —^—Q(\k\ < A) . (A.21) 

The cut-off | A; | < A needs some explanation, because it appears not to be 
Lorentz invariant. In fact, G(|A;| < A) is just a symbolic notation. A simple 
minded cut-off like \k 2 \ < A 2 is invariant, but it is not restrictive enough, 
because k can grow along light-like directions (k 2 = 0) without being cut off. 

A more appropriate definition can be found by using the "Wick rotation" . 
As can be seen from figure [A. 71 , in each loop integration, the integration 
contour in the complex kn plane can be deformed from the dashed curve 
to the dotted curve without crossing singularities. Then we can perform 
the substitution (k°, k) = (ik E , and the Minkowski "length" becomes 
a Euclidean length k 2 = (k ) 2 — k 2 = — (k E ) 2 — k 2 = —k 2 E . The Lorentz 
invariant cut-off k 2 E < A 2 is now effective, because there are no light-like 
directions in Euclidean space. We shall interpret 0(|fc| < A) in this fashion. 

We can now compare the theories defined with two different cut-offs A 
and A' < A. It turns out that we can change the theory in such a way 
that the physics does not change when we lower the cut-off, as long as the 
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external momenta remain below the new cut-off. To see how this works, let 
us introduce a graphical notation 

iD A (k 2 ) = (A.22a) 

iD A ,(k 2 )= (A.22b) 

iD A (k 2 ) - iD A ,(k 2 ) = ^=^= (A.22c) 

for propagators in the full theory, the low energy theory and their difference. 
For the one-loop correction to the propagator we find below the new cut-off 

___Q___ = ____£'___ +____Q___ • (A.23) 
Therefore the physics will not change, if we introduce a new vertex 

into the low energy theory. When we calculate ( A.24Q explicitely, it will be 
useful later to calculate the more general integral 

r r\ D k ( 1 \ n 

I(n, D) = / — — 0(A < \k\ < A) , (A.25) 



(2n) D \k 2 + ie 

defined in D space-time dimensions. As discussed above, we will perform a 
"Wick rotation" 

I(n, D) = J^{£) ©(A' 2 < 4 < A 2 ) • (A.26) 

The volume of the surface of the D-dimensional sphere is given by 

2ir D/2 

dSl ° = fm (A ' 27) 

and therefore 

which can be calculated explicitely 

(-l) n i A 2 

I<D ' 2 - D)= (4 T )W( C /2) ' n A^ (A ' 29a) 
^ *)U/, - {i Jo,^ /2) D ,l_ n (A°- - A'—) • (A.29b) 
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In the case of the tadpole diagram ( |A.24| ) we find finally 



/ (Sro e(A ' - - A) = i/(M) = w? ( A ' 2 - A ' 2 ) (A ' 30) 

for the new vertex. In the four point function 

✓ \ ✓ \ * \ / ~~ \ ✓ \ 

/ \ / \ / \ / \ / \ 

(A.31) 

we need more vertices. First a modified four particle vertex 

Under the assumption that the external momenta satisfy \p\ < A' < A, we 
can do the loop integral in ( A.32|) easily to leading order 



1 f d 4 k i6(A' < \k\ < A) i&(A' <\p-k\<A) 



2 J (2vr) 4 k 2 + ie (p - A;) 2 + ze 



1 2 ln^ + O^l . (A.33) 



2 16vr 2 A' 2 V A2 

The exact expression is more complicated, due to the finite integration limits, 
but we will not need its explicit form. More important is the observation that 
all integrals exist, since they are over a compact domain in Euclidean space, 
far from any singularities. Below, after motivating the physics, we shall see 
how to perform calculations more efficiently Anyway, there are three such 
diagrams and the new vertex is 

3 1 A 2 n 2 
--—^ln^^-(f)\x). (A.34) 
2 16vr 2 A' 4! v ; 

There is also a new six particle vertex 



(A.35) 



that reads (again neglecting small terms 0(p 2 /A 2 )): 



2 



T^ ix) - (A ' 36) 



in 



The general procedure consists in a stepwise reduction of the scale A' 

A > A' > A" > A'" > . . . (A.37) 

where the Lagrangian is adjusted in each step such that the low energy 
observables below the current cut off scale is unchanged. If we allow arbitrary 
powers of the field <fi and derivatives, the low energy physics can indeed be 
reproduced fully as shown above. 



A. 2. 4 Renormalization Group Flow 
The most general interaction can be expanded 

L(x) = Y,9iOi(x) (A.38) 

i 

in an infinite series of local operators 

0(x) = (0 2 (x), 4 (x), (<90) 2 (x), 6 (x), (0<90) 2 (x), . . . } , (A.39) 

where an infinite sum will not necessarily correspond to a local interaction. 
A discrete renormalization group transformation of the scale dependent cou- 
pling constants 

9l (A) - ft (AO = £r<,-(A', A)#(A) (A.40) 
i 

is defined by the requirement that the low energy physics remains unchanged. 

However, discrete transformations like (|A.40|) are technically hard to han- 
dle. It is much more transparent to consider a continuous transformation 
with infinitesimal generators instead. Then the RGE is a system of coupled 
non-linear ordinary differential equations 

!dft(A' 



As a concrete example, consider the evolution of the coefficients g± and g$ 
L int (x) = |0 4 (x) + |0 6 (x) + . . . (A.42) 
with a change of the cut-off scale A', as depicted on the left hand side of 



figure A.8. By construction, the low-energy physics does not change along 



the trajectories in figure |A.8|, as long as each coupling uses the same A'. 
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A'/A g 4 

Figure A. 8: The evolution of the coefficients g± and g$ as a function of a cut- 
off scale A' (left hand side) and after elimination of the unphysical cut-off 
scale A' (right hand side). 



The cut-off scale A' is therefore just conventional and determines what is 
considered to be part of the lowest order Lagrangian and what is to be 
calculated from it in higher orders. Thus the scale A' is redundant and can 
be eliminated from the physical parameter space. This physics is therefore 
not determined by a point in the parameter space, but by a whole trajectory 
instead. 

Since A' is redundant, it is desirable to eliminate it completely. As long 
as (?4 (A') can be inverted (which is always possible in fixed orders of PT), 
the physics on the left hand side of figure |A.8| can be recast as the right 
hand side of the same figure, where the dimensionful cut-off scale A' has 
be traded for the dimensionless coupling parameter g^. This is an example 
of the celebrated phenomenon of dimensional transmutation ||149|| , of which 



the relation of the strong coupling as and the QCD scale Aqcd is the most 
famous example. 

So far, we have ignored the dependence on the upper cut-off scale A, which 
was assumed to be fixed and much larger than any physics scale. After the 
introduction of the RG trajectories, the scale A obtains a straightforward 
interpretation as the starting point of the trajectory. In this interpretation, 
the low energy physics will not depend on A if the RGE can be solved for 
all A. However, the existence of global solutions is not a trivial condition 
and there are several cases to consider. 

In the scenario depicted on the left hand side of figure |A.9| , all trajecto- 
ries extend to A' — > oo. Therefore, the starting point of the evolution can 
be pushed back to A — > oo, which means that the cut-off can be removed 



113 



exp. 

A' 



exp. 

A' 



Figure A. 9: All trajectories on the left hand side extend to A' — > oo and a 
continuum limit exists. All trajectories on the right hand side even remain 
perturbative for A' — ► oo. 



altogether and a continuum limit exists. However, one must be careful that 
the trajectories might lead through non perturbative regions with g ^> 1 and 
that a perturbative determination of the evolution equation is unreliable in 
these regions. 

Even more favorable is the scenario depicted on the right hand side of 
figure |A.9| , where all trajectories that start in the low energy regime, where 
couplings are determined experimentally, remain perturbative for all A' — > 
oo. Perturbative calculations are reliable everywhere in this scenario. The 
most favorable special case of this scenario is asymptotic freedom, [|144|1 in 
which the couplings vanish for A' — > oo and low orders of PT provide reliable 
predictions for high energy experiments. 

In stark contrast to the previous two scenarios, no trajectory on the left 
hand side of figure |A. 10| can be extended from the low energy domain beyond 
the scale A . Thus the QFT that describes the present experiments can not 
be valid above Ao, unless non-perturbative effects change the trajectories 
significantly close to A . Such a scenario is not entirely bad news, because it 
predicts that something "interesting" will happen around A : at least strong 
interactions, maybe even "new physics." 

An interesting combination of the scenarios in figures |A.9| and |A.10| is 
shown on the right hand side of figure A.10| : some trajectories can be ex- 
tended to A' — > oo, while others are confined to A' < A . Typically, there 
will be a critical coupling go(A') and trajectories with g(A') < go(A') can be 
extended, while trajectories with g(A') > go(A') can not. As a result, there 
will be upper limits on the couplings at any given scale, if the theory is in- 



114 




Figure A. 10: The low energy theory on the left hand side can never be valid 
above A . The low energy theory on the right hand side can be valid above A 0; 
but only for a restricted part of the parameter space. 



tended to be defined at arbitrary high scales. The most famous example for 
this phenomenon is provided by the upper limits on the Higgs self coupling 
resulting in an upper limit in the Higgs mass m 2 H = g/2 ■ (0) 2 in the SM ||150|| . 

A. 2. 5 Relevant, Marginal Or Irrelevant? 

While the graphical representation or the RG flow is very suggestive for a 
few couplings, the untruncated problem has to deal with infinitely many 
couplings. Therefore, it is not obvious that the intuition gained from the 
low-dimensional examples can be used in the infinite dimensional space of 
couplings. This is particularly problematic, because operators of higher and 
higher dimension produce more and more divergent contributions to Feynman 
diagrams and need even higher dimensional counterterms (see section |A.1.2|) . 

Fortunately, "the first shall be last and the last shall be first" and the 
higher dimensional operators turn out to be harmless from the point of view 
of the RG. If there was no interaction, the couplings would be constant. In 
this case, we can estimate matrix elements of the operators of dimension d 
by dimensional analysis as (in four space-time dimensions) 




(A.43) 



where A is the high energy scale at which the "bare" couplings are defined 
and d is the power counting dimension of the operator, counted according to 
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(I A . 9 b| ) and ( |A.10| ). In ( |A.43| ), we can identify three different cases according 
to the value d 



• d < 4: these operators are termed relevant, because their contributions 
are becoming more important at low energies, where experiments are 
performed. 

• d — 4: these operators are termed marginal. 

• d > 4: these operators are termed irrelevant, because their contribu- 
tions are becoming less important at low energies, where experiments 
are performed. 

Therefore, the initial conditions at high energies for higher dimensional ( "non 
renormalizable" ) interactions play no important role in low energy physics. 

Unless there are very strong interactions, relevant and irrelevant operators 
will retain their property, even if interactions are switched on. This can be 
seen from the evolution 

A' 2 ^ = A' 2 ^(A') + 
dA' loir 2 

2 d^V) = 3 1 2 , 
dA /2 2 16vr 2y4V ; 

a' 2 ^ = - 2S m +■■■ 

dA' 2 A' ; 
of the low dimensional operators: 

• d = 2 (i.e. #2 or m 2 ): the rate of change of this coupling is of dimen- 
sion 2: 0(A' 2 ). Unless there is significant fine tuning of the initial 
conditions, the value of the coupling will be of the order of the high 
energy starting point of the evolution: g 2 = 0(A 2 ) ^> A' 2 . 

• d — 6 (i. e. go) : the rate of change of the coupling is of dimension —2: 
0(A' -2 ). Therefore the value of the initial condition at the high en- 
ergy scale is indeed irrelevant with respect to the value induced by the 
evolution. 

To avoid a possible misunderstanding induced by the technical term "irrele- 
vant operator" , it is important to stress that the contributions of irrelevant 
operators do not necessarily vanish in the infrared. Instead, the dependence 
of these contributions on their initial conditions at high energy vanishes, 
because the RG trajectories flow together in the infrared, as shown in fig- 
ure PCTT| 



(A.44a) 
(A.44b) 
(A.44c) 
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Figure A. 11: Contributions of irrelevant operators do not necessarily vanish 
in the infrared, but the dependence of these contributions on their initial 
conditions at high energy vanishes in the infrared. 



Thus the initial conditions for the irrelevant operators have no effect on 
the results of low energy experiments. In fact, the coefficients of the irrelevant 
operators can be set to zero at the high scale without changing the observable 
physics at lower energies. Therefore, the phenomena can be described by a 
renormalizable theory. 

To avoid another possible misunderstanding, this observation does not 
"prove" that only a renormalizable QFT can describe the observed phenom- 
ena. It only states that there is always are renormalizable QFT that is 
indistinguishable from a non-renormalizable theory at low energies. Since 
renormalizable theories are technically more convenient and depend on less 
parameters, common sense and Occam's razor suggest to prefer the renor- 
malizable QFT with only relevant or marginal interactions at the high scale 
over the others with the same infrared behavior. 

The technical advantage of renormalizable theories lies in the fact that, 
for a given set of conserved quantum numbers, there is only a finite number 
of relevant or marginal operators, which is even quite small. This simplifies 
calculations and increases the predictive power of these theories because fewer 
free parameters have to be fitted in experiments. 

In fact, from the point of view of the RG, the relevant ("super renormal- 
izable") operators are more problematic than the non renormalizable ones. 
The most prominent examples are masses of light particles. Unless there is 
a symmetry that forces the mass to be small, most trajectories will lead to 
a value of the mass of the order of the high energy cut off scale, which is 
incompatible with the assumption that the particle is light. The selection of 
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Figure A. 12: Suppressed contributions to the (3-function for g±, which 
are 0(A~ 2 ). 



a trajectory that flows towards a light mass in the infrared requires unnatural 
fine tuning in this case. 

Fermion masses can be protected by a chiral symmetry and need no fine 
tuning. The same applies to the Goldstone bosons of a spontaneous sym- 
metry breaking. Goldstone's theorem protects their masses as well. Other 
scalars need to be related to fermions through a supersymmetry in order to 
be protected against a naturally large mass. 

The marginal operators are the most interesting from the theoretical point 
of view. Without interaction, their contributions do not change with the 
scale. Therefore even weak interactions can make the marginal couplings 
grow strong or become weak asymptotically. From the RGE for g± 

A' 2 ^ = /^ 4 (A')) (A.45) 

it can be seen that the behavior of the coupling depends on the sign of the 
/3-function 

• (3 > (as in the scalar example considered above: (3 = 3gf /(327r 2 )): 
the coupling grows strong in the ultraviolet and becomes weak in the 
infrared, 



/3 < (e.. g. QCD, see section |A.2.9|) : the coupling grows strong in the 



infrared and becomes weak in the ultraviolet. 



The solution of ( A.45 ) for our scalar example is 

04(A) 

1 + ^4(A)ln A 



S4(A') = ; , / 4V ;L A2 • (A.46) 



Figure [A.12| displays more contributions to the /3-function for g±, but all these 



contributions are 0(A -2 ) and can be ignored in leading order. 

If the interactions are strong enough, relevant operators can become 
marginal or even irrelevant and vice versa. However, this is impossible in 



118 



the perturbative domain and therefore outside our present scope. The non 
perturbative calculation of path integrals via the solution of a RGE is in 
principle possible, but technically very complicated. The known solutions 
require a truncation of the RGE and the systematic error introduced in this 
fashion is very hard to quantify. 



A. 2. 6 Leading Logarithms 

In addition to providing an explanation for the fortunate circumstance that 
nature seems to be described by a renormalizable QFT, the study of the 
RG flows provides a more reliable calculational tool than the direct pertur- 
bative evaluation of Feynman diagrams. 

In theories with widely separated mass scales, PT suffers from large coeffi- 
cients in the perturbative series that are caused by logarithmic integrals from 
one mass scale to the next. In contract, the coefficients of the RGE never 
contain more than one scale, which is determined by the slice of momentum 
space that is being integrated out. Thus these coefficients can not contain 
large logarithms and the theory remains perturbative as long as the cou- 
plings are weak. Observable large logarithms are recovered only in the final 
stage of the calculation, the solution of the RGE. Therefore, large unphysical 
logarithms that cancel in observables can never appear in intermediate steps. 



A. 2.7 Callan Symanzik Equations 
The calculation of integrals over a small slice of momentum space like (|A.25|) 



or ( |A.33| ) is technically demanding, because the finite integration limits make 
it impossible to use many of the standard tricks of the trade, like the shifting 
of integration variables. In addition, fixed cut-offs manifestly violate gauge 



invariance, which makes their practical application in gauge theories ||151|| 
very awkward. 

At this point, the relation of the RG flow to perturbative renormaliza- 
tion comes in handy. A change of the lower cut off scale A' corresponds to 
a change of the Lagrangian that keeps the physics unchanged. In pertur- 
bative renormalization, the renormalization scale fi plays a similar role as 
a scale that characterizes how much of the interaction is absorbed into the 
Lagrangian in the form of local counter terms. Therefore, it is tempting to 
identify A' with fi. 

The analogy is not complete, however. The hard cut off A' is replaced by 
counter terms that correspond to a soft cut off, which is switched on gradu- 
ally. This is the price to pay for gauge invariance and technical convenience. 
If we were to sum all orders of the perturbative series, both methods will 
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give the same result. For a truncated series, however, differences of higher 
order will remain. 

Consider a general n-point function 

G {n) (x 1 ,x 2 ,... ,x n ;g,n) = (O|T0(xi)0(x 2 ) . . . <j>(x n ) |0) , (A.47) 

which is renormalized at //, i. e. the invariants of the momenta in the renor- 
malization conditions are equal to /i. A change of \l will not change the 
physics. Instead, the theory is shifted to another point on the same trajec- 
tory. Therefore, there is a new value g' for the renormalized coupling, such 
that only the unobservable normalization of the field operators is changed 4>(x) 
l/VZ<j>(x). Thus 

G< n >(xi,x 2 ,... ,x n ;g',n') = Z^ 2 ^, (x u x 2 , . . . ,x n ;g,n). (A.48) 

As above, it is more convenient to replace the discrete RG transformations 
by continuous transformations with infinitesimal generators 

f i'^-j(Z n / 2 ^, f i')G^(x 1 ,...,x n ;g',^))=0 (A.49) 

where d/d/i' is a total derivative, that includes the change of the coupling g 
as well. Therefore, the Callan-Symanzik Equation (CSE) reads 

^ + (3(g)^ + n 7 (<?)J G^( Xl , ...;g,(i) = 0, (A.50) 

with 

13(g) = H^-g(ji) (A.51a) 

i{g) = — sV-t- z (vo, y) ■ (A.sib) 

The /^-function is dimensionless. In a massless theory, it can therefore not 
depend on // for dimensional reasons. In theories with masses, there are 
always renormalization schemes that retain this property [ 153 |. In other 
schemes, (3 can depend on ratios of fi and masses. 

However, it is not obvious that 7 does not depend on /i through the 
ratio /i//io- In renormalizable theories, the limit /j,q — > 00 exists and 7 can 
not depend on fiQ. As we have explained in section [A.2.5| above, we can 
always chose a trajectory that corresponds to a renormalizable theory and 
drop the dependence of 7 on /i . 
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The actual calculation of the coefficient functions (3 and 7 proceeds by cal- 
culating a sufficiently large set of renormalized Green's functions and finding 
functions (3 and 7 such that ( |A.50p is satisfied. The non-trivial statement of 
the Callan-Symanzik Equation (CSE) is that one universal function /3 for each 
coupling and one universal function 7 for each field suffices to satisfy (|A.50|) 
for all Green's functions. 

If there is more than one coupling and more than one field, the general- 
ization is obvious 

d k d m \ 

^ + z2^)^.+22 n ^)jG ( - ni '---' nm) {x 1 ,... ,g k ,u) = 0. 

(A.52) 

For example, the QED vertex functions obeys the CSE 

u— + (3(e) — + 2^(e)+ lA (e)j {Q\T$(x x )$(x 2 )A v {xs)\ti) = . (A.53) 

A. 2. 8 Running Couplings and Anomalous Dimensions 
As a concrete example for solving the CSE, consider a four-point function 

P2 P3 

G^{P;g,u) = =F.T.(O|T0(x 1 )0(x 2 )0(x 3 )0(x 4 )|O) (A.54) 
Pi Pi 

in the deep Euclidean domain (this does not correspond to a physical ampli- 
tude, but can be part of one): 

p\ = -P 2 (A.55a) 

PiPj = . (A. 55b) 

Lowest order PT gives for the connected (but not amputated) part 

G^(P;g,u) = (-ig) + (g 2 ). (A.56) 

Ignoring corrections from mass terms, dimensional analysis predicts that to 
all orders in PT 

G^(P-g,u)= f^\ 4 G^(P/u;g), (A.57) 
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i. e. that is a homogenous function and solves the differential equation 

^G^\P-g^) = - U + G^\P-g^) . (A.58) 

Plugging ( |A.58| ) into the CSE ( |A.50| ), we can trade the dependence of the 
renormalization scale for the momentum dependence and obtain the Renor- 
malization Group Equation (RGE) for the four point function 



(A.59) 



The RGE can be solved with standard techniques for the integration of a 

Partial Differential Equation (PDE). The introduction of a running coupling 
constant g(P; g) 

P^pg(P;g)=P(g(P;g)) (A.60a) 

g(ji;g)=g (A.60b) 
absorbs the differential operator 

-/%)!- W;s) = (A.61) 



dP y *'dg 
and the general solution can be written as 

G {A \P;g^)={^) G (4 \g(P;g))-exp 4 J <V^J, (A.62) 

where is an arbitrary function of one variable, which is not determined 
by the RG. However, matching the solution (A.62) of the RGE to the per- 
turbative result ( |A.56|) at P — [A gives 

G {i \g) = -ig + 0{?). (A.63) 

In general, we can identify two elements in the solution of the RGE. One 
part of the solution is given by the perturbative result at a given order with 
the coupling constant replaced by the running coupling at the appropriate 
scale. In addition, there is an exponential factor modifying the scaling be- 
havior of each field in the n-point function under consideration. Since the 7 
appear in the numerator of the exponent, they are usually called anomalous 
dimensions. 
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From the /3-function for 4 



167T : 



;9 2 + 0(g 3 



(A.64) 



follows the running coupling 



g(P;g) 



1 - 



16"7T- 



-g In 



(A.65) 



The solution ( |A.62|) has the nice feature that all large logarithms ln(P 2 / /j, 2 ) 
are collected in two universal places, that are common to all Green's func- 
tions: the running coupling g(P; g) and the exponents of the scale factors. 
Thus the large logarithms have been resummed successfully in universal 
quantities and PT is reliable, as long as g(P; g) is small. 

Unfortunately, this procedure does not work as straightforwardly for all 
Green's functions. The RGE is much harder to solve if not all external 
momenta scale uniformly. If there is more than one relevant mass scale, the 
coefficient functions themselves can contain large logarithms. The classical 
example for large logarithms that can not be obtained from the RG are the 



Sudakov double logarithms | |154|| in exclusive scattering processes 
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(A.66) 



m 



A. 2. 9 Asymptotic Freedom 

As discussed earlier, the behavior of the /3-function in the vicinity of g = 
determines the properties of PT. For a single coupling, the /3-function must 
vanish for g = 0, because without interactions, there is no mechanism that 
could induce running of the coupling. 

Figures |A.13| and [A.14| display the four possible different scenarios. On 
the left hand side of figure |A.13| , we have (3 > everywhere and the running 
coupling grows in the ultraviolet without limits. However, PT is reliable in 
the infrared. 



On the right hand side of figure |A.13j , we have (3 < everywhere and 
the running coupling grows in the infrared without limits. PT is reliable 
in the ultraviolet, but problematic in the infrared. The theory of strong 
interactions, QCD, has this property | |144 |, called asymptotic freedom and it 
is known that non abelian gauge theories are the only class of theories in four 
space-time dimensions that can exhibit asymptotic freedom. 
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Figure A. 13: (3 -functions for infrared free (left) and ultraviolet free (right) 
theories. The arrow denotes the direction of the evolution for a rising scale. 




Figure A. 14: f3 -functions with ultraviolet (left) and infrared (right) stable 
fixed points. The arrow denotes the direction of the evolution for a rising 
scale. 



In the left hand side of figure A. 14, the coupling grows in the ultraviolet, 



but the /3-function makes a turn and the coupling becomes constant after 
running into an ultraviolet stable fixed point at g*, because (3{g*) = 0. The 
opposite phenomenon appears in the right hand side of figure |A.14| , where g* 
is an infrared stable fixed point. 

A. 2. 10 Dimensional Regularization 

As mentioned above, the RG functions f}(g) and 7(g) are universal. They are 
determined by calculating fidG^" 1 ' / d/j, for a sufficient set of and solving 
the CSE for (3(g) and 7(0). 
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A typical integral appearing in these equations is 



^ M '^"/g lf -S' +ir < A - 67 > 

where the factor /i 4_D has been added in order to make the mass dimension 
of / independent of the dimension of space time. As long asm — n — D/2 
and D/2 + n are not equal to negative whole numbers, the integral can be 
evaluated 

I n , m {D,M 2 ) = 

(-l) n+m ^ M ^ 2 +n- m ( M 2 \ D/2 - 2 T{m-n-D/2)T{D/2 + n 



16vr 2 v ' V 167r V7 T(m)T(D/2) 

(A.68) 

and the logarithmic divergency of /o,2(4, M 2 ) shows up as the pole of the Euler 
T-function. This divergency is regularized by moving away from D = 4 

/ , 2 (4-2 e ,M=) = I ^( I ^)" , r (e ). (A.69) 

T(e) can be expanded in a Laurent series, that starts with a single pole 

4 2 (4-2 e ,M 2 ) = I ^Q + ln^ + 21n4vr- 7i; ) . (A.70) 

In a massless theory, the dependence on /i must be identical to the depen- 
dence on the renormalization scale and the expansion of the T-function shows 
that it suffices to determine the residues of the poles in e = 2 — D/2. Even 
in massive theories, there are renormalization schemes that retain this con- 
venient property [p.53|| . 

This procedure of dimensional regularization ||143|| and minimal subtrac- 



tion of the poles in 1/(D — 4) is particularly useful in gauge theories, be- 
cause gauge invariance is not violated by the regularization and no additional 
counter terms are needed in order to maintain gauge invariance. Since gauge 
invariance softens divergencies, dimensional regularization avoids the overes- 
timation of radiative corrections induced by a naive application of regular- 
ization procedures that violate gauge invariance. 
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Figure A. 15: The contributions of the vertex function and the self energy to 
the (3 -function in QED cancel. 



A. 2. 11 Gauge Theories 
In QED, we find for the lowest order RG functions 



[3(e) 
1a (e) 
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(A.71a) 
(A.71b) 
(A.71c) 



There is a peculiarity of QED: the Ward identity enforces that the contri- 
butions of the vertex function and the self energy (see figure |A.15| ) to the 
/3-function cancel to all orders and the /3-function can be calculated from the 
vacuum polarization (figure |A.3| ) alone. 

From (|A.71a|) , we see that /?qed > and that QED is not asymptotically 
free. Higher order calculations have not revealed an ultraviolet fixed point 
either. 

A more complicated calculation in a non-abelian SU (Nc) gauge theory 



with Nf quarks has the result ||144 



16tt 2 



where 



2 3 



2N, 



c 



(A.72) 



(A.73) 



In QCD (i.e. Nc = 3), this means 

/3qcd(#) 



, . -N f - 11 

16vr 2 V 3 1 



(A. 74) 
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Figure A. 16: Non abelian contributions to the vacuum polarization: gluon 
and ghost loops. 



and the theory is indeed asymptotically free for Nf < 33/2. As a result, the 
running coupling of QCD falls with rising energy: 

4:71 

where 0.15 GeV < A QCD < 0.25 GeV and (3 = 11 - 2N f /3. 

The most obvious difference to an abelian gauge theory like QED is the 
existence of the genuinely non-abelian contributions to the vacuum polar- 
ization displayed in figure [A.16j . These diagrams do indeed give a negative 
contribution to the /3-function. 

Intuitively, the negative contribution from the gluon and ghost loops gives 
a fair description of the interplay of the positive quark contribution and the 
negative non-abelian contributions. Strictly speaking, however this expla- 
nation is not correct, because the cancellations of vertices and self energies 
familiar from QED are not operative and more complicated diagrams also 
give contributions to the /5-function. 



A. 3 Effective Field Theory 

Ein Bild erhdlt ja gerade dadurch seine 
Brauchbarkeit, dafi es nicht die Sache ist. 
[EGON FRIEDELL: Kulturgeschichte Griechenlands] 



A. 3.1 Flavor Thresholds and Decoupling 

So far, all effects of particle masses have been neglected, in order to highlight 
the simplicity of the RG approach. This is often a very good approxima- 
tion. For example at LEP2, fermion masses can be neglected in four fermion 
production, unless one of the final state particles is an electron and a t- 
channel singularity has to be regularized. The massless approximation is a 
good approximation, because the LEP2 center of mass energy is far from all 
flavor thresholds: m b <C v /slep2 < m t- Top quarks can not be produced and 
m b /M w is a small number. 
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Close to a threshold, the RG loses some of its appeal, because the value 
of the threshold introduces a second scale that enters the calculation of the 
RG functions and makes calculations more complicated. However, Effective 
Field Theory (EFT) [ |155|| provides a systematic way of dealing with flavor 
thresholds in a RG calculation. 

Recall the /3-function of a SU(iVc) gauge theory ( |A.72| ). The quarks 



loops in the gluon vacuum polarization induce a dependence on the number 
of quark flavors 

P vfiJuyL^^QQfl. oc J^Cf-J- {A.76) 
and in a concrete calculation we have to answer the question for the value 



of Nf. This question can not be answered by counting the quarks in table [TT 
alone. To see this, consider the typical case mf <C —p 2 <C m t and m\ <C 
/i 2 <C m t . Then the contribution of a top quark loop in ( |A.76| ) is 



nV)=0(4>4) (A.77) 
mf mf 

and is very small below the threshold. 

The purpose of the RG is to systematically resum the large logarithms 
in the vacuum polarization and other diagrams, but we find that there are 
no large logarithms below the threshold, as long as the renormalization scale 
is chosen sensibly. Therefore, the top flavor must not be included in (|A.76|) 
below threshold and Nf = 5 is the appropriate value between the bottom 
and top thresholds. 

The fact that a heavy particle does not contribute to RG functions below 
its threshold is non-trivial. In the unrenormalized version of ([A.76 ), each 



flavor contributes equally to the high energy tail of the loop integral, as long 
as the cut off is significantly above the threshold of this flavor. Only after 
renormalization, it can be seen that each flavor also contributes equally to 
the counter term and that the net contribution of a heavy flavor vanishes, if 
the Green's functions are renormalized below the threshold of this flavor. 

Since the RG never deals with the loop integrals over all of momen- 
tum space, the cancellations described in the previous paragraph take place 
automatically. The /5-function describes the rate of change of the running 
coupling and if only active flavors contribute to this change, the cancellation 
of contributions from heavy flavors and their counter terms is already mani- 
fest. This argument, which has been made heuristically using ( |A.76| ) for the 



/^-function, can be made rigorous in the decoupling theorem [ p.56|| ; also for 
the anomalous dimensions. 
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However, there is one circumstance under which the effects of a heavy 
particle can be felt in the RG flow below its threshold. This the case if 
the large mass violates a symmetry of the theory. The most celebrated 



example of this effect is the breaking of the SU(2) C custodial symmetry [43] 
of the SM through the heavy top mass and the corresponding contribution 
to the p-parameter. In a perturbative calculation with masses, there is a 
contribution to p that is proportional to the quadratic mass splitting in the 
third generation 



n — iiij. 



U w+w -(q 2 )= 4 f oc * b + ■ ■ ■ (A.78) 




7T M? 



w 



If every SU(2)l doublet was degenerate, the decoupling theorem would ap- 
ply and each heavy doublet would decouple from the low energy phenomena. 
The resulting low energy physics could be described by an EFT built from 
SU(2) c -invariant interactions. If, as in the third generation of the SM, there 
is a strong lifting of the degeneracy, there will be an effect on the low en- 
ergy couplings that manifests itself as a p-parameter that is different from 
unity. Then the low energy EFT below the top threshold has to contain 
SU(2) c -violating operators. Below the bottom threshold, there are no new 
SU(2) C violating contributions from the fermion sector, but the effects accu- 
mulated between m t and rrib remain. If there would be no reason to expect 
a custodial symmetry or p = 1, the low energy EFT would contain SU(2) C - 
violating operators from the outset, of course. 

The general result of the decoupling theorem is that, unless symmetries 
are violated, heavy particles can be removed from a theory, if only phenomena 
below the energy scale given by the particle's mass are to be calculated. The 
only residual effect of the heavy particle remains in a renormalization of 
coupling constants and fields. 



A. 3. 2 Integrating Out or Matching and Running 

The decoupling of heavy flavors discussed in the previous section provides 
a systematic calculational procedure for theories with more than one mass 
scale. For example, in order the calculate the strong coupling at low energies 
from an experimental measurement at a 1 TeV LC, one proceeds as follows: 

1. start with the measured value go at po — 1 TeV 3> m t : g(po) = go 

2. solve the massless RGE g(p) with the /3-function for six active flavors 
from p = p down to p = m t 
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Figure A. 17: Running couplings with thresholds. In an asymptotically free 
gauge theory, the running is slowed down at each new flavor threshold. 



3. start a second evolution step at fx — nit, using the result of the first 
step as initial value 

4. solve the massless RGE g(fi) with the /3-function for five active flavors 
from fi = m t down to fi = m^. 

5. start a third evolution step at fi = mb, . . . 

6. and so on 

This procedure is known matching and running and is illustrated in fig- 
ure [A.17| . The matching at each threshold is continuous, but not differen- 
tiable, because the rate of change (i. e. the /5-function) changes discontinu- 
ously at each threshold. 

The approximation of the flavor thresholds by step functions is not op- 
timal, of course. However, for many observables, the detailed features of 
the threshold are much less important than the large logarithms between 
thresholds and the matching and running procedure provides a systematic 
way of resumming these logarithms. In any case, the calculations can be im- 
proved systematically by including higher orders (NLO, NNLO) in the loop 
expansion, as well as power corrections. 



The most famous example (already mentioned in section |1.2| ) for a thresh- 



old effect is given by the running coupling of the three SM gauge groups. The 



experimentally determined couplings do not meet |To[ in one point as would 
be a necessary condition of a GUT. However, matching the evolution of the 
couplings in the SM to the evolution of the couplings in the MSSM at the 
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threshold provided by the supersymmetry breaking scale /j, ~ 1 TeV causes 
the three couplings to meet [0. This may be an accident, but it is the most 
suggestive phenomenological evidence for supersymmetry so far. 



A. 3. 3 Important Irrelevant Interactions 



We have argued in section |A.2.5| , that operators of dimension greater than 
four are irrelevant and that the influence of their initial value at a high 
matching scale on the physics at much lower energies can be ignored. 

This argument can not be applied in two important cases. If the irrelevant 
interaction violates a symmetry that is respected by all relevant and marginal 
operators, the effects of the irrelevant interactions can be observable, even if 
they are small in absolute terms. A different problem arises if the hierarchy 
of scales is not large enough and the lever arm of the RG is therefore not large 
enough to suppress irrelevant operators when running from one threshold to 
the next. 

A typical example for the latter case is provided by the thresholds for the 
top quark and the W bosons. The power corrections Mw/m t m 0.45 can 
not be neglected relative to the logarithmic terms | \n(Mw/m t )\ m 0.8. The 
solution in cases like 5°_B°-mixing is to decouple the top quark and W boson 
simultaneously 

d b 

W 

+ tk "t = $ 




d 



b d 



M 2 W 
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(A.79) 
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where the function $ is calculated from the massive loop |157|| . Since the 
scales in the loop are rather close, no large logarithms can develop and the 
loop expansion for $ is well behaved. 

The | AS | = 2 operator discussed in the previous paragraph is also an 
example for first case, but the more common example is provided by tree 
level charged current interactions. Below Mw, the exchange of a W can be 
approximated by a local interaction 



W 





+ 



M 2 W 



of dimension six 



G - 

-J= ip(l - 75)7^ ^(1 - 75) W 



(A.80) 



(A.8i; 
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which is irrelevant. However, the Fermi interaction Lp can not be ignored, 
because it includes flavor changing terms that are absent in all relevant or 
marginal operators provided by the electromagnetic and strong interaction. 

The SM W /± -exchange is matched to the local Fermi interaction ( |A.81|) 
at n = M w like flOCp . Below the W -mass, all calculations are performed 
using the local Fermi interaction. The irrelevant operator (|A.81 ) is not renor- 




malizable in the traditional perturbative sense, because it requires counter 
terms of higher dimensions. In the EFT, this is no problem, however, be- 
cause these counter terms are all determined by matching conditions at My/. 
Consequently, there is no loss of predictive power. 

The radiative corrections can be summed below Mw by solving a RGE 
that includes the anomalous dimension of the irrelevant operator that is 
derived from diagrams like 



+ . . . 



A. 3. 4 Mass Independent Renormalization 

A useful intuitive picture for the matching and running procedure is that 
more and more physics from higher scales is integrated out as the renor- 
malization scale is lowered. The physics is not lost in this process, but is 
condensed from Feynman diagrams into the short distance interaction ver- 
tices. One can imagine this as if all momentum integrations would be carried 
out from above down to the renormalization scale and that the remaining 
low energy momentum integrations would be cut off at the renormalization 
scale. 

This intuitive picture is, however, not entirely accurate. As mentioned 
above, a fixed momentum space cut off is technically inconvenient, because 
of complicated integrals. But more important is that the fixed cut off is 
physically dangerous (see e.g. |50D, because gauge symmetries are violated 
by cut-offs. Therefore the dividing line between the short distance physics, 
which is absorbed in effective local interactions, and the long distance physics 
is gauge dependent and thus unphysical. Typically, the unphysical short 
distance coefficients will be spuriously large in magnitude, but their contri- 
butions will cancel in physical observables against unphysical long distance 
contributions. In order to exploit the cancellation of unphysical contribu- 
tions as soon as possible, it is preferable to extend all loop integrals over the 
whole momentum space and to replace the hard cut offs by counter terms. 
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The price to pay is small, because the resulting divergencies can be handled 
with traditional perturbative renormalization theory. Calculations in mass 
independent scheme like minimal subtraction are convenient and do respect 
gauge invariance. The physically important threshold effects are handled by 
matching. 



A. 3. 5 Naive Dimensional Analysis 

If the theory at the high energy scale is known, EFT is but a very powerful 
calculational tool. It allows to use the most appropriate degrees of freedom 
at each scale by matching different theories at thresholds. Furthermore, large 
radiative corrections can be summed systematically through the renormal- 
ization group. 

In such a scenario, the short distance coefficients are calculated by match- 
ing and running from the top to the bottom. The opposite case is when low 
energy data is fitted to an arbitrary effective Lagrangian and this Lagrangian 
is evolved from bottom to top until it becomes inconsistent, as shown in fig- 
ure |A.10| . In this case, reliable estimates of the scale of "new physics" can be 



made, but the "new physics" above the threshold can rarely be determined 
from the matching conditions at the threshold. 

Nevertheless, we can use EFT to estimate the likely size of contributions 
from "new physics" to observables at future experiments using Naive Dimen- 
sional Analysis (NDA) JL46 , |158| , |2T| . We know from general RG arguments, 



that the relevant operators will have larger coefficients at low energy than 
the irrelevant operators of the same symmetry. That this statement remains 
true after renormalization was first pointed out in ||146|| . Unfortunately these 



arguments alone give no indication of the absolute magnitude of the coeffi- 
cients. 

From ( |A.30| ) and ( |A.33|) we see that typical loop integrals that contain a 



single scale can be estimated by a factor of 

1 



16tt 2 



6.3 -10" 3 , (A.82) 



which is multiplied by a power of this scale as dictated by simple dimen- 
sional analysis. A systematic analysis ||146j , |158|| reveals that the small num- 



ber ( A.82 ) creates a hierarchy that can be used classify terms. 



One possible formQof the lowest order effective Lagrangian in the momen- 



4 There are many equivalent forms [ pl| , but all involve the field <j>(x) only in the com- 
bination <fi(x)/f. 
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turn expansion ||146|| for a non-linearly realized broken abelianP] symmetry is 



L = f tr {(D^UfD^U) (A.83a) 

where 

U(x) = e^ {x)/f (A.83b) 

and / is a scale introduced by the symmetry breaking. It corresponds to 
the vacuum expectation value of a Higgs field in a linear realization of the 
symmetry breaking. 

Loops renormalize the operators of a given order as well as higher orders 
in the momentum expansion. For example, a loop with two lowest order 
terms 

-i ((j)(x)%<j){x)} (A.84) 
renormalized at a scale /i 2 





167T 2 / 2 

where P 2 is an invariant built from the external momenta, requires a coun- 
terterm from the next order of the momentum expansion 



1 1 

A 2 ? 



-d u (j){x)d l ,(j>{x)d v (j){x)d> 1 (j){ X ) , (A.86) 



where A is the scale characterizing the momentum expansion. A priori, 
there is no relation between the scales / and A for symmetry breaking and 
momentum expansion, respectively. However, unless there is some unnatural 
fine-tuning of parameters at work, the rate of change of the couplings in the 



RG running from ([A.85 ) should be of the same order as the coefficients of 

(A.87) 



the counterterm ( |A.86j ). Equating coefficients gives 

1 1 1 



16vr 2 f 2 A 2 



5 In the general non-abelian case, the formulae are more complicated, because the factor 
group manifold G/H has to be parametrized These geometrical complications have 
no impact on the power counting argument. 
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or the celebrated formula 



A « Airf . (A. 



From another perspective, the RG will evolve towards ( |A.88[ ) in the infrared 



unless a symmetry that has not been made manifest in the effective La- 
grangian forces the RG flow in a particular direction. A physical thresh- 
old between / and A would require matching and provides the only way 
around (|A.88|) in an EFT. Such a physical threshold will however be observ- 
able independently. 

The heuristic argument for the lowest order in the momentum expansion 
for scalar fields can be extended systematically to all orders in the loop and 
momentum expansions and to fermions and vector fields ||158|| . The result is 



that a natural interaction involving scalars 0, fermions ip, derivatives d and 
gauge bosons A has the size 

L{N>m) = ^ (?) im) (t) " (!) < A - 89 > 

for any quartet (JV^, N^, Na, Nq) of natural numbers. The power counting 
rules assume that the vector boson A is indeed a gauge boson, such that 
the two parts of the covariant derivative d — ieA scale identically. The rules 
would be much more complicated for non-gauge bosons and their longitudinal 
polarization states. 

The important result of NDA is that the natural strengths ([A.89 ) of 



interactions can be estimated, even in a strongly interacting EFT. The as- 
sumption that couplings have their natural strengths is nothing more than an 
assumption, of course. Nevertheless, the predictions from NDA work for low 
energy strong interactions and NDA can be used as a guide while searching 
for good places for hunting "new physics." Higher dimensional operators are 
most likely suppressed by a factor of l/(167r 2 ) and proposals for experiments 
should take this into account. 
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